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1 Introduction: the notion of ODEs and examples 


A differential equation (Dif ferentialgleichung) is an equation for an unknown function 
that contains not only the function but also its derivatives (Ableitung). In general, the 
unknown function may depend on several variables and the equation may include various 
partial derivatives. However, in this course we consider only the differential equations 
for a function of a single real variable. Such equations are called ordinary differential 
equations! — shortly ODE (die gewohnliche Dif ferentialgleichungen). 

A most general ODE has the form 


Epa, wu) = leat 


where F’ is a given function of n + 2 variables and y = y (x) is an unknown function of a 
real variable x. The maximal order n of the derivative y in (1.1) is called the order of 
the ODE. 

The ODEs arise in many areas of Mathematics, as well as in Sciences and Engineering. 
In most applications, one needs to find explicitly or numerically a solution y («) of (1.1) 
satisfying some additional conditions. There are only a few types of the ODEs when one 
can find all the solutions. 

In Introduction we will be concerned with various examples and specific classes of 
ODEs of the first and second order, postponing the general theory to the next Chapters. 

Consider the differential equation of the first order 


y =f (x,y), (1.2) 


where y = y (x) is the unknown real-valued function of a real argument x, and f (x, y) is 
a given function of two real variables. 

Consider a couple (x,y) as a point in R? and assume that function f is defined on a 
set D C R®, which is called the domain (Definitionsbereich) of the function f and of 
the equation (1.2). Then the expression f (x,y) makes sense whenever (x,y) € D. 


Definition. A real valued function y (x) defined on an interval? J C R, is called a 
(particular) solution of (1.2) if y (a) is differentiable at any x € J, the point (2, y (x)) 
belongs to D for any x € J and the identity y' (x) = f (x, y(x)) holds for all x € I. 


The family of all particular solutions of (1.2) is called the general solution. The graph 
of a particular solution is called an integral curve of the equation. Obviously, any integral 
curve is contained in the domain D. 

Usually a given ODE cannot be solved explicitly. We will consider some classes of 
f (x,y) when one find the general solution to (1.2) in terms of indefinite integration. 


'The theory of partial differential equations, that is, the equations containing partial derivatives, is a 
topic of another lecture course. 
Here and below by an interval we mean any set of the form 


(a,b) = {tE€R:a<a<bd} 
[a,b] = {#E€R:a<ar<b} 
a,b) = {we R:a<a<b}} 
(a,b) = {@eER:a<a<}}, 


where a,b are real or too and a < b. 


Example. Assume that the function f does not depend on y so that (1.2) becomes 
y = f (x). Hence, y must be a primitive function? of f. Assuming that f is a continuous 
(stetig) function on an interval J, we obtain the general solution on J by means of the 
indefinite integration: 


y= f f@)de=F@) +e 
where F' (x) is a primitive of f(x) on J and C is an arbitrary constant. 


Example. Consider the ODE 

y =y. 
Let us first find all positive solutions, that is, assume that y(x) > 0. Dividing the ODE 
by y and noticing that 


/ 


y ! 
eho Iny ’ 
; (Iny) 


we obtain the equivalent equation 
(Iny)’ = 1. 


Solving this as in the previous example, we obtain 
Iny = fac=2+0 


whence 
y = ee? = Che," 


C 


where C, = e%. Since C € R is arbitrary, C, = e© is any positive number. Hence, any 


positive solution y has the form 
y= Ci e*, C, > 0. 


If y (a) < 0 for all x, then use 


y/ / 
* — (in(-y)) 
y 
and obtain in the same way 
Y= —Cie*, 


where C', > 0. Combine these two cases together, we obtain that any solution y () that 
remains positive or negative, has the form 


where C' > 0 or C' < 0. Clearly, C = 0 suits as well since y = 0 is a solution. The next 
plot contains the integrals curves of such solutions: 


3By definition, a primitive function of f is any function whose derivative is equal to f. 


Let us show that the family of solutions y = Ce”, C’ € R, is the general solution. 
Indeed, if y(a) is a solution that takes positive value somewhere then it is positive in 
some open interval, say J. By the above argument, y(x) = Ce” in J, where C > 0. 
Since e* 0, this solution does not vanish also at the endpoints of J. This implies that 
the solution must be positive on the whole interval where it is defined. It follows that 
y (~) = Ce® in the domain of y (x). The same applies if y (x) < 0 for some x. 

Hence, the general solution of the ODE y/ = y is y(x) = Ce” where C' € R. The 
constant C' is referred to as a parameter. It is clear that the particular solutions are 
distinguished by the values of the parameter. 


1.1 Separable ODE 
Consider a separable ODE, that is, an ODE of the form 


y' = f (x) gy). (1.3) 


Any separable equation can be solved by means of the following theorem. 


Theorem 1.1 (The method of separation of variables) Let f (x) and g (y) be continuous 
functions on open intervals I and J, respectively, and assume that g(y) £40 on J. Let 
F (x) be a primitive function of f (x) on I and G (y) be a primitive function of aa) on J. 
Then a function y defined on some subinterval of I, solves the differential equation (1.3) 
if and only if it satisfies the identity 


G(y(@)) =F (a) +¢, (1.4) 
for all x in the domain of y, where C' is a real constant. 


For example, consider again the ODE 7 = y in the domain x € R, y > 0. Then 
f (x) =1 and g(y) = y £0 so that Theorem 1.1 applies. We have 


F(a)= f f(@de= f de=s 


5 


and 


ey ec a i ae 
ow =f ay. -ooe 


where we do not write the constant of integration because we need only one primitive 
function. The equation (1.4) becomes 


Iny=2+C, 


whence we obtain y = C,e* as in the previous example. Note that Theorem 1.1 does not 
cover the case when g(y) may vanish, which must be analyzed separately when needed. 
Proof. Let y (x) solve (1.3). Since g (y) 4 0, we can divide (1.3) by g (y), which yields 


/ 


y 
Observe that by the hypothesis f(x) = F’ (x) and aw = G' (y), which implies by the 
chain rule ; 
y Ul / / 
= =(G(y(a 
a) (yy =(G(y(2))) 
Hence, the equation (1.3) is equivalent to 
G(y(2)) = F(a), (1.6) 


which implies (1.4). 

Conversely, if function y satisfies (1.4) and is known to be differentiable in its domain 
then differentiating (1.4) in x, we obtain (1.6); arguing backwards, we arrive at (1.3). 
The only question that remains to be answered is why y (x) is differentiable. Since the 
function g(y) does not vanish, it is either positive or negative in the whole domain. 
Then the function G (y), whose derivative is aay is either strictly increasing or strictly 
decreasing in the whole domain. In the both cases, the inverse function G~! is defined 


and is differentiable. It follows from (1.4) that 
y(x) =G'(F(z)+C). (1.7) 


Since both F and G=! are differentiable, we conclude by the chain rule that y is also 
differentiable, which finishes the proof. m 


Corollary. Under the conditions of Theorem 1.1, for all x) € I and yo € J there exists 
a unique value of the constant C such that the solution y (x) defined by (1.7) satisfies the 
condition y (xo) = Yo. 

The condition y (xo) = yo is called the initial condition (An fangsbedingung). 

Proof. Setting in (1.4) 7 = xp and y = yo, we obtain G (yo) = F’ (a)+C, which allows 
to uniquely determine the value of C,, that is, C = G (yo) — F(a). Conversely, assume 
that C is given by this formula and prove that it determines by (1.7) a solution y (a). If 
the right hand side of (1.7) is defined on an interval containing xo, then by Theorem 1.1 it 
defines a solution y (x), and this solution satisfies y (19) = yo by the choice of C. We only 
have to make sure that the domain of the right hand side of (1.7) contains an interval 
around 29 (a priori it may happen so that the the composite function G7! (F (x) + C) 
has empty domain). For x = x the right hand side of (1.7) is 


G™ (F (ao) + C) = G7" (G (yo)) = Yo 
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so that the function y (x) is defined at x = 29. Since both functions G~! and F' + C are 
continuous and defined on open intervals, their composition is defined on an open set. 
Since this set contains x9, it contains also an interval around x. Hence, the function y is 
defined on an interval around 2%, which finishes the proof. m 

One can rephrase the statement of Corollary as follows: for all 7 € J and yo € J 
there exists a unique solution y (x) of (1.3) that satisfies in addition the initial condition 
y (0) = yo; that is, for every point (7, yo) € J x J there is exactly one integral curve 
of the ODE that goes through this point. However, the meaning of the uniqueness claim 
in this form is a bit ambiguous because out of any solution y (x), one can make another 
solution just by slightly reducing the domain, and if the reduced domain still contains xo 
then the initial condition will be satisfied also by the new solution. The precise uniqueness 
claim means that any two solutions satisfying the same initial condition, coincide on the 
intersection of their domains; also, such solutions correspond to the same value of the 
parameter C’. 

In applications of Theorem 1.1, it is necessary to find the functions F’ and G. Techni- 
cally it is convenient to combine the evaluation of F' and G with other computations as 
follows. The first step is always dividing (1.3) by g to obtain (1.5). Then integrate the 


both sides in x to obtain ie 
ieee / x) dx. 1.8 
fas ff a8 


Then we need to evaluate the integral in the right hand side. If F(x) is a primitive of f 
then we write 


[f@ar=P@+e. 


In the left hand side of (1.8), we have y'dx = dy. Hence, we can change variables in the 
integral replacing function y (x) by an independent variable y. We obtain 


fda dy | 
[a7 tgy7c@re 


Combining the above lines, we obtain the identity (1.4). 

If in the equation y’ = f (x) g (y) the function g (y) vanishes at a sequence of points, say 
Y1, Y2, +--+, enumerated in the increasing order, then we have a family of constant solutions 
y(x) = yr. The method of separation of variables provides solutions in any domain 
Yr <Y < Yrqi. The integral curves in the domains yz < y < Yyr41 can in general touch 
the constant solutions, as will be shown in the next example. 


y = Jul, 


which is defined for all y € R. Since the right hand side vanish for y = 0, the constant 
function y = 0 is a solution. In the domains y > 0 and y < 0, the equation can be solved 
using separation of variables. For example, in the domain y > 0, we obtain 


2f/y=z2t+C 


Example. Consider the equation 


whence 


7. 


and i 
y=q@etcy, z>-C 


(the restriction x > —C' comes from the previous line). Similarly, in the domain y < 0, 


we obtain 
[ale 
=e 
whence 
—2/-y=x+C 
and 


1 
ioe aie £<—C. 


We obtain the following integrals curves: 


y ay 


We see that the integral curves in the domain y > 0 touch the curve y = 0 and so do the 
integral curves in the domain y < 0. This allows us to construct more solution as follows: 
take a solution y; (a) < 0 that vanishes at « = a and a solution y2 (x) > 0 that vanishes 
at « = b where a < 0 are arbitrary reals. Then define a new solution: 


yi (2), u<Ga 
y(x)=4 0, a<x<b, 
yo (2), r> 6. 


Note that such solutions are not obtained automatically by the method of separation of 
variables. It follows that through any point (29, yo) € R? there are infinitely many integral 
curves of the given equation. 


1.2. Linear ODE of 1st order 


Consider the ODE of the form 
y + a(x) y = b(2) (1.9) 
where a and 0 are given functions of x, defined on a certain interval J. This equation is 


called linear because it depends linearly on y and y/. 
A linear ODE can be solved as follows. 


Theorem 1.2 (The method of variation of parameter) Let functions a(x) and b(x) be 
continuous in an interval I. Then the general solution of the linear ODE (1.9) has the 
form 


y (x) =e A@) fr) eA da, (1.10) 
where A(x) is a primitive of a(x) on I. 


Note that the function y (x) given by (1.10) is defined on the full interval J. 
Proof. Let us make the change of the unknown function u (x) = y (x) eA), that: is, 


y (2) = u(x) e 4, (1.11) 
Substituting this to the equation (1.9) we obtain 
ue“)! + aue~“4 = b, 


wer =e 8A ae f= th 


Since A’ = a, we see that the two terms in the left hand side cancel out, and we end up 
with a very simple equation for u (x): 


'eo A= 


U= [ deta. 


Substituting into (1.11), we finish the proof. m 
One may wonder how one can guess to make the change (1.11). Here is the motivation. 
Consider first the case when 6 (a) = 0. In this case, the equation (1.9) becomes 


whence u! = be4 and 


y +a(xz)y=0 


and it is called homogeneous. Clearly, the homogeneous linear equation is separable. In 
the domains y > 0 and y < 0 we have 


y 
—=-a(r 
; (x) 
and j 
{2 i [ a(ajdr=-A(a) +0. 
Then In |y| = —A(x) +C and 
y (2) = Ce 


where C' can be any real (including C = 0 that corresponds to the solution y = 0). 

For a general equation (1.9) take the above solution to the homogeneous equation and 
replace a constant C' by a function C'(x) (or which was denoted by u(x) in the proof), 
which will result in the above change. Since we have replaced a constant parameter by 
a function, this method is called the method of variation of parameter. It applies to the 
linear equations of higher order as well. 


Example. Consider the equation 
1 1 x 
y+—y=e (1.12) 
a 
in the domain x > 0. Then 
d 
A(x) = [ a(@)de = [= =Inz 
Le 


(we do not add a constant C' since A (2) is one of the primitives of a(x)), 


le ee ee ae ee ee 
yie)=— fe da = = fe dx ae (c +C), 


where C' is an arbitrary constant. 
Alternatively, one can solve first the homogeneous equation 


fee 
y ry = 0, 
ne 
using the separable of variables: 
UP, Go atl 
y  « 
(Iny)’ = —(Ina) 
Iny = —-Inz+C, 
C 
UN, Sy ey 
bi 


Next, replace the constant C' by a function C (x) and substitute into (1.12): 


Oy eae 


x £e 
Cxr-C C = 
oa | 
C" 2 
i — ev 
x 
C= & x 
e Ls 
Clan = Je nde = 5 (c¢ +p) . 


Hence, 


ee), 


a Ww 
where Co is an arbitrary constant. 


Corollary. Under the conditions of Theorem 1.2, for any x € I and any yo € R there 
is exists exactly one solution y (x) defined on I and such that y (xo) = yo. 


That is, though any point (20, yo) € J x R there goes exactly one integral curve of the 
equation. 


10 


Proof. Let B(x) be a primitive of be~“ so that the general solution can be written 
in the form 
y = eA (B(x) +0) 


with an arbitrary constant C’. Obviously, any such solution is defined on J. The condition 
y (20) = yo allows to uniquely determine C' from the equation: 


G = ype) —B (x0) ; 


whence the claim follows.’ 


1.3  Quasi-linear ODEs and differential forms 


Let F (x,y) be a real valued function defined in an open set 2 C R?. Recall that F is 
differentiable at a point (x,y) € 2 if there exist real numbers a,b such that 


F(x+dz,y + dy) — F (x,y) = adx + bdy + 0(|dz| + |dyl), 


as |dx| + |dy| — 0. Here dx and dy the increments of x and y, respectively, which are 
considered as new independent variables (the differentials). The linear function adx + bdy 
of the variables dx, dy is called the differential of F' at (x,y) and is denoted by dF’, that 
is, 

dF = adx + bdy. (1.13) 


In general, a and 6 are functions of (, y). 
Recall also the following relations between the notion of a differential and partial 
derivatives: 


1. If F is differentiable at some point (x, y) and its differential is given by (1.13) then 
the partial derivatives F,, = or and Fy, = oe exist at this point and 


Ff, =a, BF, = 0. 


2. If F is continuously differentiable in 9, that is, the partial derivatives F’, and F, 
exist in Q and are continuous functions then F' is differentiable at any point in Q. 


Definition. Given two functions a (x,y) and b(x,y) in Q, consider the expression 
a(x, y) dx + (x,y) dy, 


which is called a differential form. The differential form is called exact in 2 if there is a 
differentiable function F’ in 2 such that 


dF = adz + bdy, (1.14) 


and inexact otherwise. If the form is exact then the function F from (1.14) is called the 
integral of the form. 


Observe that not every differential form is exact as one can see from the following 
statement. 
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Lemma 1.3 I[f functions a,b are continuously differentiable in Q then the necessary con- 
dition for the form adx + bdy to be exact is the identity 


Gy = dg. 


Proof. Indeed, if there is F’ is an integral of the form adz + bdy then F, = a and 
F, = 6, whence it follows that the derivatives F’, and Fi, are continuously differentiable. 
By a well-know fact from Analysis, this implies that F,, = Fy,whence ay = by. ™ 


Example. The form ydx — xdy is inexact because a, = 1 while b; = —1. 
The form ydx + ardy is exact because it has an integral F’ (x,y) = xy. 
The form 2rydx + (x? + y*) dy is exact because it has an integral F (x,y) = x?y + v 
(it will be explained later how one can obtain an integral). 
If the differential form adz + bdy is exact then this allows to solve easily the following 
differential equation: 
a(xz,y) + 6(2,y)y' =0. (1.15) 


This ODE is called quasi-linear because it is linear with respect to y’ but not neces- 
sarily linear with respect to y. Using y! = “, one can write (1.15) in the form 


~~ dx? 
a(x,y) dx + b(x,y) dy = 0, 


which explains why the equation (1.15) is related to the differential form adx + bdy. We 
say that the equation (1.15) is exact if the form adx + bdy is exact. 


Theorem 1.4 Let be an open subset of R?, a,b be continuous functions on Q, such that 
the form ada + bdy is exact. Let F' be an integral of this form. Consider a differentiable 
function y (x) defined on an interval I C R such that the graph of y is contained in Q. 
Then y solves the equation (1.15) if and only if 


F (x,y (x)) = const on I. 


Proof. The hypothesis that the graph of y(x) is contained in 2 implies that the 
composite function F (x,y (a)) is defined on J. By the chain rule, we have 


d 

—F (z,y(x)) = F, + Fy = a+ by’. 

dx 

Hence, the equation a + by’ = 0 is equivalent to 4F (x,y(x)) = 0, and the latter is 
equivalent to F'(x,y(«)) = const. m 


Example. The equation y + xy’ = 0 is exact and is equivalent to ry = C' because 
ydz +axdy = d(xy). The same can be obtained using the method of separation of variables. 
The equation 2ry + (x? + y*) y' = 0 is exact and is equivalent to 


3 
2 7] 
—=C. 
ae ae 


Below are some integral curves of this equation: 
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How to decide whether a given differential form is exact or not? A partial answer is 
given by the following theorem. 

We say that a set Q C R? is a rectangle (box) if it has the form I x J where J and J 
are intervals in R. 


Theorem 1.5 (The Poincaré lemma) Let 2 be an open rectangle in R?. Let a,b be 
continuously differentiable functions on Q such that a, = b,. Then the differential form 
adx + bdy is exact in Q. 


Proof of Theorem 1.5. Assume first that the integral F’ exists and F' (xo, yo) = 0 
for some point (29, yo) € © (the latter can always be achieved by adding a constant 
to F). For any point (2,y) € Q, also the point (x, yo) € Q; moreover, the intervals 
[(%o, Yo); (, yo)} and [(x, yo) , (x, y)] are contained in 2 because 1) is a rectangle. Since 
F,, =a and F, = b, we obtain by the fundamental theorem of calculus that 


x x 


F, (8, yo) ds = | a(s, yo) ds 


J xO 


F(x, yo) = F (x, yo) — F (#0, yo) = l 


J xO 


F (2,y) — F (2, yo) = [ Fye.nae [ v(e.tyat 


“Vo “Yo 


whence 


x 


F (x,y) = fa(s.yo)ds+ [ o(0.1) dt. (1.16) 


xo 


Now use the formula (1.16) to define function F (x,y). Let us show that F is indeed the 
integral of the form adz + bdy. Since a and b are continuous, it suffices to verify that 


F,=a and F,=b. 
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It is easy to see from (1.16) that 


Next, we have 


= = fal aso [oa t) dt 
a Ox : S, Yo) as Or Jy, wy, 


“20 


ae, 
= — t) dt. 1.1 
(ew) +f sble0 (1.17) 


The fact that the integral and the derivative 2 can be interchanged will be justified below 
(see Lemma 1.6). Using the hypothesis b, = a,, we obtain from (1.17) 


y 
pa a(x, Yo) Tv i dy (x,t) dt 
“VO 


= a(x, yo) + (a(x,y) — a(x, Yo) 
= a(z,y), 


which finishes the proof. m 
Now we prove the lemma, which is needed to justify (1.17). 


Lemma 1.6 Let g (x,t) be a continuous function on I x J where I and J are bounded 
closed intervals in IR. Consider the function 


B 
f(a) =f g(w,tha, 


where [a, 3] = J, which is defined for all x € I. If the partial derivative g, exists and is 
continuous on I x J then f is continuously differentiable on I and, for any x € I, 


B 
Aa) if Ge. (@,t) ab. 


In other words, the operations of differentiation in x and integration in t, when applied 
to g (x,t), are interchangeable. 
Proof of Lemma 1.6. We need to show that, for all x € J, 


f(w') ~ f (2) 


GS 7 


B 
/ gx (x,t) dt as x’ > 2, 


which amounts to 


P g(a',t) — g(a,t a 
freee eia | Gx (x,t) dt as av! > x. 


vu — 2 
Note that by the definition of a partial derivative, for any t € [a, (], 


g (z', t) 7g (x,t) 


; — gy (x,t) asa’ > a. (1.18) 
a —£ 
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Consider all parts of (1.18) as functions of ¢, with fixed x and with 2’ as a parameter. 
Then we have a convergence of a sequence of functions, and we would like to deduce 
that their integrals converge as well. By a result from Analysis II, this is the case, if the 
convergence is uniform (gleichmassig) in the whole interval [a, 3], that is, if 


lt) = t 
sup GES IAS 4) — gr (x,t)} +0 asa’ za. (1.19) 
te[a,6] ee) 
By the mean value theorem, for any t € [a, 6], there is € € {, x’] such that 


g (2, t) = g (x,t) 
a2 


= Ga (€, t) : 


Hence, the difference quotient in (1.19) can be replaced by g, (€,t). To proceed further, 
recall that a continuous function on a compact set is uniformly continuous. In particular, 
the function g, (x,t) is uniformly continuous on J x J, that is, for any e > 0 there is 6 > 0 
such that 


£,€el,|c—£€|<dandt,se J |t—s|<6 => |g, (z,t) — gz (E,8)| <e. (1.20) 
If | — x’| < 6 then also |x — €| < 6 and, by (1.20) with s =t, 
lgx (€,t) — ge (x, t)| << € for allte J. 
In other words, |” — 2’| < 6 implies that 


fh t 
sup |Z0st) = 9 (8) 
teJ yaa 


GAGE) Se 


whence (1.19) follows. m= 
Consider some examples to Theorem 1.5. 


Example. Consider again the differential form 2rydx + (x? + y?) dy in Q = R?. Since 


dy = (2ry), = 20 = (a? +y°), =z, 


x 


we conclude by Theorem 1.5 that the given form is exact. The integral F’ can be found 
by (1.16) taking xo = yo = 0: 


P(z,y) = | 2s0ds + i (a? +07) dt =a7y+ = 
J0 JO 
as it was observed above. 


Example. Consider the differential form 


—ydx + xdy 
page (1.21) 


in 2 = R? \ {0}. This form satisfies the condition a, = b, because 


- ( y ) 2 PRE SUP. Se 
ay = — = 
y 


P+ jie wry 
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and 


io 


_ (@+y)—20? 
ety), (arty (a +42)?" 

By Theorem 1.5 we conclude that the given form is exact in any rectangular domain in 

(2. However, let us show that the form is inexact in (2. 


Consider the function 6 (x, y) which is the polar angle that is defined in the domain 
OFS RP 4G 0) ees 0} 
by the conditions 
e 
sinf = e cosd=—-, 9€ (-7,7), 
e r 


where r = ,/a? + y?. Let us show that in 1’ 


—ydx + xdy 


In the half-plane {x > 0} we have tané = 4 and @ € (—7/2,7/2) whence 
6 = arctan 2. 
a 


Then (1.22) follows by differentiation of the arctan: 


1 rdy—ydx _ —ydx + «dy 


a 
L+(y/c)  @ w+? 


In the half-plane {y > 0} we have cot @ = 7 and @ € (0,7) whence 
6 = arccot — 


and (1.22) follows again. Finally, in the half-plane {y < 0} we have cot@ = < and 0 € 


(—7,0) whence 
@ = — arccot (- = ; 
y 


and (1.22) follows again. Since 2’ is the union of the three half-planes {a > 0}, {y > 0}, 
{y < 0}, we conclude that (1.22) holds in 9’ and, hence, the form (1.21) is exact in (’. 

Why the form (1.21) is inexact in Q? Assume from the contrary that the form (1.21) 
is exact in 2 and that F is its integral in 2, that is, 


_ —ydx + ady 


dF 
x? + y? 


Then dF = dé in Q’ whence it follows that d(F — 6) = 0 and, hence’ F = 6 + const in 
Q’. It follows from this identity that function 6 can be extended from 1’ to a continuous 


4We use the following fact from Analysis II: if the differential of a function is identical zero in a 
connected open set U C R” then the function is constant in this set. Recall that the set U is called 
connected if any two points from U can be connected by a polygonal line that is contained in U. 

The set ’ is obviously connected. 
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function on 2, which however is not true, because the limits of 6 when approaching the 
point (—1,0) (or any other point (x,0) with « < 0) from above and below are different. 

The moral of this example is that the statement of Theorem 1.5 is not true for an 
arbitrary open set 9. It is possible to show that the statement of Theorem 1.5 is true 
if and only if the set Q is simply connected, that is, if any closed curve in 2 can be 
continuously deformed to a point while staying in 2. Obviously, the rectangles are simply 
connected (as well as 9’), while the set 2 = R? \ {0} is not simply connected. 


1.4 Integrating factor 
Consider again the quasilinear equation 
a(x,y)+b(a,y)y' =0 (1.23) 


and assume that it is inexact. 
Write this equation in the form 


adx + bdy = 0. 
After multiplying by a non-zero function M (x,y), we obtain an equivalent equation 


Madz + Mbdy = 0, 


which may become exact, provided function M is suitably chosen. 
Definition. A function M (x,y) is called the integrating factor for the differential equa- 
tion (1.23) in 2 if M is a non-zero function in Q such that the form Madzx + Mbdy is 
exact in 2. 

If one has found an integrating factor then multiplying (1.23) by M the problem 
amounts to the case of Theorem 1.4. 


Example. Consider the ODE 
y 


Ag?y + x’ 
in the domain {x > 0,y > 0} and write it in the form 


ydx — (407y + x) dy = 0. 


y= 


Clearly, this equation is not exact. However, dividing it by x?, we obtain the equation 
1 
de — (4 + =) dy = 0, 


which is already exact in any rectangular domain because 


) 1 1 
2 re en MLD ry ee 
ey Sg (y+ =), 


Taking in (1.16) 2 = yo = 1, we obtain the integral of the form as follows: 


"1 Z 1 
Fwy) = | Sas— | (a+ =) M280 = 2, 
Ji § J1 x x 


By Theorem 1.4, the general solution is given by the identity 
Qy? + ee 
a 


Lf. 


1.5 Second order ODE 


A general second order ODE, resolved with respect to y” has the form 


y= Tt ayy, 


where f is a given function of three variables and y = y (x) is an unknown function. We 
consider here some problems that amount to a second order ODE. 


1.5.1 Newtons’ second law 


Consider movement of a point particle along a straight line and let its coordinate at 
time t be x(t). The velocity (Geschwindigkeit) of the particle is v(t) = a’ (t) and the 
acceleration (Beschleunigung) is a(t) = 2” (t). The Newton’s second law says that at 
any time 

mae =F, (1.24) 


where m is the mass of the particle and F' is the force (Kraft) acting on the particle. In 
general, F’ is a function of t,x, 2x’ so that (1.24) can be regarded as a second order ODE 
for x (t). 

The force F' is called conservative if F depends only on the position x. For example, 
conservative are gravitation force, spring force, electrostatic force, while friction and the 
air resistance are non-conservative as they depend in the velocity v. Assuming F’ = F (), 
denote by U (x) a primitive function of —F'(«). The function U is called the potential of 
the force F’. Multiplying the equation (1.24) by x’ and integrating in t, we obtain 


m f ate'dt = [Fo x’ dt, 


* [ Fyra= [ Fae. 
mv? 


ED Ura 
and ; 
a +U (a) =C. 


mv2 


The sum “> + U (2) is called the total energy of the particle (which is the sum of the 
kinetic energy and the potential energy). Hence, we have obtained the law of conservation 
of energy: the total energy of the particle in a conservative field remains constant. 


1.5.2. Electrical circuit 


Consider an RLC-circuit that is, an electrical circuit (Schaltung) where a resistor, an 
inductor and a capacitor are connected in a series: 
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V(t) iL 


| 
II 
C 


Denote by R the resistance (Wider stand) of the resistor, by L the inductance (Induktivitat) 
of the inductor, and by C' the capacitance (Kapazitat) of the capacitor. Let the circuit 
contain a power source with the voltage V (t) (Spannung) where t is time. Denote by 
I(t) the current (Strom) in the circuit at time t. Using the laws of electromagnetism, we 
obtain that the potential difference vp on the resistor R is equal to 


UR = RI 
(Ohm’s law), and the potential difference vz on the inductor is equal to 
dl 
= L— 
amen: 


(Faraday’s law). The potential difference vc on the capacitor is equal to 


UG SAR; 


C 


where Q is the charge (Ladungsmenge) of the capacitor; also we have Q! = I. By 
Kirchhoff’s law, we have 
Ur +vz, +c = V (t) 


whence 


RI+Lh +3 =V(). 


Differentiating in t, we obtain 


I 
LEE REE aM", (1.25) 


which is a second order ODE with respect to J (t). We will come back to this equation 
after having developed the theory of linear ODEs. 


2 Existence and uniqueness theorems 


2.1 1st order ODE 


We change notation, denoting the independent variable by ¢ and the unknown function 
by x(t). Hence, we write an ODE in the form 


ESF tae) 


19 


where f is a real value function on an open set 2 C R? and a pair (t,x) is considered as 
a point in R?. 

Let us associate with the given ODE the initial value problem (An fangswertproblem) 
- shortly, IVP, which is the problem of finding a solution that satisfies in addition the inztial 
condition x (to) = xo where (to, 79) is a given point in 2. We write IVP in a compact 


form as follows: je ee 
LT (Gea) 

A solution to IVP is a differentiable function x(t) : J — R where J is an open interval 
containing to, such that (t,x (t)) € © for all t € J, which satisfies the ODE in J and the 
initial condition. Geometrically, the graph of function x(t) is contained in 2 and goes 
through the point (to, vo). 

In order to state the main result, we need the following definitions. 
Definition. We say that a function f :Q— R is Lipschitz in x if there is a constant L 
such that 


for all t,z,y such that (t,2) € Q and (t,y) € Q. The constant L is called the Lipschitz 
constant of f in 2. 

We say that a function f : Q — R is locally Lipschitz in x if, for any point (to, 7) € Q 
there exist positive constants ¢,6 such that the rectangle 


R= [to — 6, to +6] x [a0 — €, 20 + €] (2.2) 


is contained in 2 and the function f is Lipschitz in R; that is, there is a constant LD such 
that for all t € [tp — 6, t) + 6] and x,y € [xo — €, 20 + €], 


Note that in the latter case the constant L may be different for different rectangles. 


Lemma 2.1 (a) [f the partial derivative f, exists and is bounded in a rectangle RC R? 
then f is Lipschitz in x in R. 

(b) If the partial derivative f, exists and is continuous in an open set Q C R? then f 
is locally Lipschitz in x in Q. 


Proof. (a) If (t,x) and (t, y) belong to R then the whole interval between these points 
is also in R, and we have by the mean value theorem 


f (t, 2) = f (ty) = fee (t, €) (x — y), 
for some € € [x,y]. By hypothesis, f, is bounded in R, that is, 


L:= sup |f,| <0, (2.3) 
R 


whence we obtain 
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Hence, f is Lipschitz in R with the Lipschitz constant (2.3). 

(b) Fix a point (to, 2%) € 2 and choose positive ¢,6 so small that the rectangle R 
defined by (2.2) is contained in 2 (which is possible because 2 is an open set). Since 
R is a bounded closed set, the continuous function f, is bounded on R. By part (a) we 
conclude that f is Lipschitz in R, which means that f is locally Lipschitz in Q. m 


Example. The function f (t,7) = |x| is Lipschitz in x in R? because 
Iz] —Iyll < |e — yl, 


by the triangle inequality for |x|. Clearly, f is not differentiable in x at x = 0. Hence, the 
continuous differentiability of f is sufficient for f to be Lipschitz in x but not necessary. 


The next theorem is one of the main results of this course. 


Theorem 2.2 (The Picard - Lindelof theorem) Let 2 be an open set in R? and f (t,x) 
be a continuous function in Q that is locally Lipschitz in x. 


(Existence) Then, for any point (to, to) € Q, the initial value problem IVP (2.1) has a solution. 


(Uniqueness) If x1 (t) and x2 (t) are two solutions of the same IVP then x, (t) = x9 (t) in their 
common domain. 


Remark. By Lemma 2.1, the hypothesis of Theorem 2.2 that f is locally Lipschitz in 
x could be replaced by a simpler hypotheses that f, is continuous. However, as we have 
seen above, there are examples of functions that are Lipschitz but not differentiable, and 
Theorem 2.2 applies for such functions. 

If we completely drop the Lipschitz condition and assume only that f is continuous 
in (t,a) then the existence of a solution is still the case (Peano’s theorem) while the 
uniqueness fails in general as will be seen in the next example. 


Example. Consider the equation x’ = \/|x| which was already solved before by separa- 
tion of variables. The function x (t) = 0 is a solution, and the following two functions 


8 
LoS 
+ 
Nil 
I 


t2, t>0 
4 d p) 
a(t) = sey 26 
4 p) 
are also solutions (this can also be trivially verified by substituting them into the ODE). 
Gluing together these two functions and extending the resulting function to t = 0 by 


setting x (0) = 0, we obtain a new solution defined for all real t (see the diagram below). 
Hence, there are at least two solutions that satisfy the initial condition x (0) = 0. 
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The uniqueness breaks down because the function \/|x| is not Lipschitz near 0. 
Proof of existence in Theorem 2.2. We start with the following observation. 


Claim. Let x(t) be a function defined on an open interval IC R. A function x (t) solves 
IVP if and only if x(t) is continuous, (t,x (t)) € Q for allt € I, to € I, and 


t 
elt) =a +f f (s, x (s)) ds. (2.4) 
to 
Indeed, if x solves IVP then (2.4) follows from 2’ = f (t, x (t)) just by integration: 


ie (s)ds = [3 (s, a (s)) ds 
whence | 
z(t) — 2 =| f (s,x (s)) ds. 


Conversely, if x is a continuous function that satisfies (2.4) then the right hand side of 
(2.4) is differentiable in t whence it follows that x (t) is differentiable. It is trivial that 
x (to) = 2%, and after differentiation (2.4) we obtain the ODE a’ = f (t,x). 

This claim reduces the problem of solving IVP to the integral equation (2.4). Fix a 
point (to, 7%) € Q and let ¢,6 be the parameter from the the local Lipschitz condition at 
this point; that is, there is a constant L such that 


for all t € [to — 6, to + 6] and 2, y € [xo — €, 29 +6]. Set 
J = (xp —€,2%9 +¢] and I= [tp -—1r,to +7], 


were 0 < r <6 is anew parameter, whose value will be specified later on. By construction, 
Ix JcQ. 


22 


Denote by X be the family of all continuous functions x (t) : J — J, that is, 
X ={x:I— J | xis continuous} 


(see the diagram below). 


I=[to-r,to+1] 


Consider the integral operator A defined on functions x € X by 


Az (t) =a + / f (s, x (s)) ds, 


which is obviously motivated by (2.4). To be more precise, we would like to ensure that 
x € X implies Ar € X. Note that, for any x € X, the point (s,x(s)) belongs to 2 so 
that the above integral makes sense and the function Az is defined on J. This function 
is obviously continuous. We are left to verify that the image of Az is contained in J. 
Indeed, the latter condition means that 


|Az (t) — xo| < € for all t € J. (2.5) 


We have, for any t € J, 


ot 
| Az (t) — xo| = | f (s,a(s))ds| < sup |f (s,x)||t—to| < Mr, 
J to sel ved 
where 
M= sup__|f (s,x)| < oo. 
s€[to—d,to+d] 


x€[xo—e,xz0+¢] 


Hence, if r is so small that Mr < ¢ then (2.5) is satisfied and, hence, Ax € X. 
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To summarize the above argument, we have defined a function family X and a mapping 
A:X — X. By the above Claim, a function x € X will solve the IVP if function x is a 
fixed point of the mapping A, that is, if « = Az. 

The existence of a fixed point will be obtained using the Banach fixed point theorem: 
If (X,d) is a complete metric space (Vollstandiger metrische Raum) and A: X — X is 
a contraction mapping (Kontraktionsabbildung), that is, 


d(Az, Ay) < qd(z,y) 


for some q € (0,1) and all z,y € X, then A has a fixed point. By the proof of this 
theorem, one starts with any element 29 € X, constructs a sequence of iteration {x,}°~ 
using the rule %,4; = Arp, n = 0,1,..., and shows that the sequence {x,,}°° , converges 
in X to a fixed point. 

In order to be able to apply this theorem, we must introduce a distance function d 
(Abstand) on X so that (X,d) is a complete metric space and A is a contraction mapping 
with respect to this distance. 

Let d be the sup-distance, that is, for any two functions x,y € X, set 


d(a,y) = sup |x (t) — y (t)|. 
tel 
Using the fact that the convergence in (X,d) is the uniform convergence of functions and 
the uniform limits of continuous functions is continuous, one can show that the metric 
space (X,d) is complete (see Exercise 16). 
How to ensure that the mapping A: X — X is a contraction? For any two functions 
x,y € X and any t € IJ, we have x(t), y(t) € J whence by the Lipschitz condition 


Ar (t)- Avil = ff fle(9)as— f f(s,u(s))as 


IA 
so 
i 
8 
—— 
a 
| 
Ra 
——~ 
WD 
— 
Q 
W 


IN IA 


Therefore, 
sup [Ae (f) — Ay (1) $ sup (s) — y(s)| L|é~ ta 
€ se 


whence 
d(Az, Ay) < Lrd(z,y). 


Hence, choosing r < 1/L, we obtain that A is a contraction, which finishes the proof of 
the existence. i 


Remark. Let us summarize the proof of the existence of solutions as follows. Let ¢,6, L 
be the parameters from the the local Lipschitz condition at the point (to, 20), that is, 


Ift,a)— fyi < Lla-yl 
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for all t € [to — 6, to + 6] and 2, y € [zo — €, 2 +]. Let 


M =sup{|f (¢,2)| +t € [to —4,t0 +4], x € [to —€,%0 + €]}. 


Then the IVP has a solution on an interval [to — r, to + r] provided r is a positive number 
that satisfies the following conditions: 
1 


E 
< < — =. i 
ROP SS ey: Pe (2.6) 


For some applications, it is important that r can be determined as a function of ¢, 6, M, L. 


Example. The method of the proof of the existence in Theorem 2.2 suggests the following 
procedure of computation of the solution of IVP. We start with any function 7 € X (using 
the same notation as in the proof) and construct the sequence {x,}7-, of functions in 
X using the rule t,4; = Ax,. The sequence {2} is called the Picard iterations, and it 
converges uniformly to the solution x (t). 

Let us illustrate this method on the following example: 


eos 


Az (t) = 1+ [x (s)ds, 


whence, setting x (t) = 1, we obtain 


The operator A is given by 


t 
x()=1+ [ rods = 1+, 
J0 


t 2 
t 
m(Q=1+ f nds=14t45 
JO 
t #2 3 
s(t) =1+ | adti=1+t+ ata 
and by induction 
t? p t” 
ti (t)=l+t+ ata t+. 


Clearly, x, — e! as n — oo, and the function x (t) = e’ indeed solves the above IVP. 


For the proof of the uniqueness, we need the following two lemmas. 


Lemma 2.3 (The Gronwall inequality) Let z(t) be a non-negative continuous function 
on [to, ti] where to < t,. Assume that there are constants C,L > 0 such that 


HS O44 f 2(s)ds (2.7) 


J to 


for all t € [to, ti]. Then 
z(t) < Cexp (L(t — to)) (2.8) 


for all t € [to, ¢]. 
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Proof. We can assume that C is strictly positive. Indeed, if (2.7) holds with C = 0 
then it holds with any C > 0. Therefore, (2.8) holds with any C > 0, whence it follows 
that it holds with C = 0. Hence, assume in the sequel that C' > 0. This implies that the 
right hand side of (2.7) is positive. Set 


Fiyao+ f 2(s)ds 


J to 


and observe that F' is differentiable and F” = Lz. It follows from (2.7) that z < F whence 
Peat Le. 


This is a differential inequality for F’ that can be solved similarly to the separable ODE. 
Since F' > 0, dividing by F' we obtain 


whence by integration 


F (t) NBT (3) : 
” FU) =| Foes | Lds = L(t— to), 


for all t € [to, ti]. It follows that 
F(t) < F (to) exp (L(t — to)) = Cexp (L(t — to)). 
Using again (2.7), that is, z < F', we obtain (2.8). m 


Lemma 2.4 If S is a subset of an interval U C R that is both open (of fen) and closed 
(abgeschlossen) in U then either S is empty or S =U. 


Any set U that satisfies the conclusion of Lemma 2.4 is called connected (zusammenhangend). 
Hence, Lemma 2.4 says that any interval is a connected set. 

Proof. Set S° = U\S so that S° is closed in U. Assume that both S and S° are non- 
empty and choose some points ao € S, bp € S°. Set c= foto so that c € U and, hence, 
c belongs to S or S°. Out of the intervals [a,c], [c, bo] choose the one whose endpoints 
belong to different sets S,.S°and rename it by [a;, bi], say a1 € S and b; € S*°. Considering 
the point c = eh we repeat the same argument and construct an interval [a2, bz] being 
one of two halfs of [a1,b:] such that az € S and b € S*. Contintue further, we obtain 
a nested sequence {[ax, by] }7-.9 of intervals such that a, € S, by € S° and |by — ax| — 0. 
By the principle of nested intervals (Intervallschachtelungsprinzip), there is a common 
point x € [ag, by] for all k. Note that « € U. Since a, — x, we must have x € S, and 
since b, — x, we must have x € S°, because both sets S and S® are closed in U. This 
contradiction finishes the proof. m 

Proof of the uniqueness in Theorem 2.2. Assume that x, (t) and «2 (t) are two 
solutions of the same IVP both defined on an open interval U C R and prove that they 
coincide on U. 

We first prove that the two solution coincide in some interval around tg. Let ¢ and 
6 be the parameters from the Lipschitz condition at the point (to,x%o) as above. Choose 
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0 <r<0dsosmall that the both functions x, (t) and 22 (t) restricted to I = [to — r, to +r] 
take values in J = [xo — €, 29 + €] (which is possible because both x; (t) and «2 (t) are 
continuous functions). As in the proof of the existence, the both solutions satisfies the 
integral identity 


z(t) =u+ | f (s,x(s)) ds 


for all t € I. Hence, for the difference z(t) := |x, (t) — x2 (t)|, we have 


z(t) =| (t)— #2 (4)| < i If (s, 1 (8)) — f(s, #2 (s))] ds, 


assuming for certainty that to < t < to +r. Since the both points (s, x, (s)) and (s, x2 (s)) 
in the given range of s are contained in J x J, we obtain by the Lipschitz condition 


|f (s, 1 (8)) — f(s, #2 (8))| S Blas (8) — ate (s)| 
whence ; 
PAbA ES L| z(s) ds. 

J to 
Appling the Gronwall inequality with C = 0 we obtain z(t) < 0. Since z > 0, we 
conclude that z(t) = 0 for all ¢ € [to, to +r]. In the same way, one gets that z(t) = 0 for 
t € [to — r, to], which proves that the solutions x, (¢) and x2 (t) coincide on the interval J. 

Now we prove that they coincide on the full interval U. Consider the set 


S={tEU: 2, (t) =22(t)} 


and let us show that the set S is both closed and open in J. The closedness is obvious: if 
x1 (te) = L2 (t,) for a sequence {t,} and t, — t € U as k — ow then passing to the limit 
and using the continuity of the solutions, we obtain x, (t) = x(t), that is, t € S. 

Let us prove that the set S is open. Fix some t; € S. Since xj (t;) = %2 (t1), the both 
functions x, (t) and x2 (t) solve the same IVP with the initial condition at t;. By the 
above argument, 21 (¢) = x(t) in some interval J = [t; — r,t; +1] with r > 0. Hence, 
I CS, which implies that S is open. 

Since the set S is non-empty (it contains to) and is both open and closed in U, we 
conclude by Lemma 2.4 that S = U, which finishes the proof of uniqueness. m 


2.2 Dependence on the initial value 


Consider the IVP ; (i,2) 
x’ =f (t,x 
, 2: 
{ Clty) = 3s 22) 
where the initial value is denoted by s instead of 2% to emphasize that we allow now s to 
vary. Hence, the solution is can be considered as a function of two variables: x = x (t, s). 
Our aim is to investigate the dependence on s. 
As before, assume that f is continuous in an open set 2 C R? and is locally Lipschitz 
in this set in x. Fix a point (to, 70) € Q and let ¢,d, L be the parameters from the local 
Lipschitz condition at this point, that is, the rectangle 


R= [to — 6, to + 6] x [ao — €,%0 + €| 
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is contained in 2 and, for all (t,x), (t,y) € R, 


Let M be the supremum of | f (t,2)| in R. By the proof of Theorem 2.2, the solution x (t) 
with the initial condition x (to) = xo is defined in the interval [t) — r,t) +1] where r is 
any positive number that satisfies (2.6), and x(t) takes values in [xo — €,29 +6] for all 
t € [to — r,to +7]. Let us choose r as follows 


. e l 
r=min (5355) . (2.10) 


For what follows, it is only important that r can be determined as a function of ¢,6,L, M. 
Now consider the IVP with the condition x (tg) = s where s is close enough to Xo, say 


8 € [vo — €/2, 29 + €/2]. (2.11) 
Then the rectangle 
R' = [to — 6, to + 6] x [s — e/2,8 + 2/2] 


is contained in R. Therefore, the Lipschitz condition holds in R’ also with constant L and 
supp |f| <M. Hence, the solution x (t, s) with the initial condition x (to) = s is defined 
in [to — r(s) ,tp +r (s)] and takes values in [s — ¢/2,s + ¢/2] C [xp — €, 29 +] provided 


: ee 
r(s) < min (6 a 5) (2,12) 


(in comparison with (2.10), here ¢ is replaced by ¢/2 in accordance with the definition of 
R’). Clearly, if r satisfies (2.10) then the value 


satisfies (2.12). Let us state the result of this argument as follows. 


Claim. Fix a point (to, %) € Q and choose ¢,5 > 0 from the local Lipschitz condition at 
(to, 20). Let L be the Lipschitz constant in R = [to — 6,to +6] x [to —€, 40 +e], M = 

sup |f|, and define r =r (€,6, L, M) by (2.10). Then, for any s € [a9 — €/2, x0 + €/2], the 
R 


solution x (t,s) of (2.9) is defined in [to — r/2,to + r/2] and takes values in [a9 — €, Uo + €]. 


In particular, we can compare solutions with different initial value s since they have 
the common domain [to — r/2,t) + 1/2] (see the diagram below). 
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i i i ' 
to for to-r/2, ty) totr/2) so totr tot 8 t 


Theorem 2.5 (Continuous dependence on the initial value) Let Q be an open set in 
R? and f (t,x) be a continuous function in Q that is locally Lipschitz in x. Let (to, 20) 
be a point in Q and let c,r be as above. Then, for all s',s” € [xp — €/2,%9 + €/2] and 
t € [to — 1/2, to + r/2], 

|x (t, s’) — x(t, 8”)| < 2|s’- 8”. (2.13) 


Consequently, the function x (t,s) is continuous in (t, s). 


Proof. Consider again the integral equations 
ot 
z(t,s’)=s'+ i 7 (73a0,8 \\er 
J to 


and 


ot 
x(t,s") = 8" + | PRs ds. 
J to 


It follows that, for all t € [to, tp + r/2], 


Ie (ts!) —a(t.s")| < |s'—s"|4 / If (r,0 (7,8!) —f alr s"))| dr 
< |s'—s"|4 [ Lies) —a(r,s")|dr, 


where we have used the Lipschitz condition because by the above Claim (7, x (7, s)) € 
[to — 6, to + 6] x [29 — €, 20 + ¢] for all s € [aq — €/2, 29 + €/2]. 
Setting z(t) = |x (t, s’) — x (t, s”)| we obtain 


sy 2 = SEE | Had 


J to 
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which implies by the Lemma 2.3 
z(t) < |s’ — s"|exp(L(t—to)). 
Since t — to < r/2 and by (2.10) L < + we see that L(t — to) < } and 
exp (L(t — to)) < el/* < 2, 


which proves (2.13) for t > to. Similarly one obtains the same for t < to. 
Let us prove that x (t,s) is continuous in (t,s). Fix a point (t,s) € Q and prove that 
x (t, 8) is continuous at this point, that is, 


£ (tn, $n) > « (t, 8) 
if (tn, 5n) — (t, 8) as n > oo. Then by (2.13) 


|Z (tn, Sn) — x(t, s)| (hes Sn) HE is) EE la BP SS) 


< 
S285) Fe eS) — BS) 


and this goes to 0 as n — oo by the continuity of x(t, s) in t for a fixed s. m 


Remark. The same argument shows that if a function x (t, s) is continuous in t for any 
fixed s and uniformly continuous in s, then x (t,s) is jointly continuous in (t, s). 


2.3. Higher order ODE and reduction to the first order system 


A general ODE of the order n resolved with respect to the highest derivative can be 
written in the form 
yO =F (t, y, dae : (2.14) 


where t is an independent variable and y (¢) is an unknown function. It is sometimes more 
convenient to replace this equation by a system of ODEs of the 1° order. 

Let x (t) be a vector function of a real variable t, which takes values in R”. Denote by 
xz the components of x. Then the derivative x’ (t) is defined component-wise by 


Tmt pay eh la 
Consider now a vector ODE of the first order 
a af (32) (2:15) 


where f is a given function of n+1 variables, which takes values in R”, that is, f :Q— R” 
where Q is an open subset of R"*! (so that the couple (t,) is considered as a point in 
Q). Denoting by f, the components of f, we can rewrite the vector equation (2.15) as a 
system of n scalar equations 


B= TG Aiea) 
Fe cca eee (2.16) 
Le, = Sn eae 5D) 
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A system of ODEs of the form (2.15) is called the normal system. 
Let us show how the equation (2.14) can be reduced to the normal system (2.16). 
Indeed, with any function y (t) let us associate the vector-function 


i (y,y', ey yD) ) 
which takes values in R”. That is, we have 
My=Y;, %2> y’, seey In = gare): 


Obviously, 
i oa (y/, Vike wna) ’ 


and using (2.14) we obtain a system of equations 


Ly = %2 
Th = 23 
(2.17) 
Tp—-1 — Un 
Gl yeti) 
Obviously, we can rewrite this system as a vector equation (2.15) where 
Pb) Seay ea ON ii) (2.18) 


Conversely, the system (2.17) implies 
a”) =z =F (ene, ser) 


so that we obtain equation (2.14) with respect to y = x;. Hence, the equation (2.14) is 
equivalent to the vector equation (2.15) with function f defined by (2.18). 


Example. For example, consider the second order equation 
ye =F (d, Y, y') . 


Setting « = (y, y’) we obtain 


whence 


can = 7% 
cs =f (t, £1, La) 


Hence, we obtain the normal system (2.15) with 


f (t, 2) = (ao, F(t, 21, £2)). 


What initial value problem is associated with the vector equation (2.15) and the scalar 
higher order equation (2.14)? Motivated by the study of the Ist order ODE, one can 
presume that it makes sense to consider the following IVP for the vector 1st order ODE 


ores 


xv (to) = Xo 
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where 29 € R” is a given initial value of x (t). For the equation (2.14), this means that the 
initial conditions should prescribe the value of the vector x = (y, i sis: yr) at some to, 
which amounts to n scalar conditions 


y (to) = Yo 
y' (to) = 1 


yo) (to) = Yn-1 


where Yo, ..-,Yn—1 are given values. Hence, the initial value problem IVP for the scalar 
equation of the order n can be stated as follows: 


PoP (Ey ye!) 
y (to) = Yo 
y' (to) =%1 


gor) (to) = Yn-1- 
2.4 Norms in R"” 
Recall that a norm in R” is a function N : R” — R with the following properties: 


1. N (a) > 0 for all x € R” and N (x) = 0 if and only if « = 0. 
2. N (cx) = |c| N (x) for all x € R” and cE R. 


3. N(a+y) < N(z)4+N (y) for all x,y € R”. 


For example, the function || is a norm in R. Usually one uses the notation |||] for a 
norm instead of N (x). 


Example. For any p > 1, the p-norm in R” is defined by 


an 1/p 
ely = (Soh | 
k=1 


In particular, for p = 1 we have 
lel = >— lea, 
k=1 


and for p = 2 


For p = o0 set 
elles =e [75 | 
1<k<n 


It is known that the p-norm for any p € [1,00] is indeed a norm. 


It follows from the definition of a norm that in R any norm has the form |||] = c |z| 
where c is a positive constant. In R”, n > 2, there is a great variety of non-proportional 
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norms. However, it is known that all possible norms in R” are equivalent in the following 
sense: if Nj (x) and Np (x) are two norms in R” then there are positive constants C” and 
C” such that 


uN Ny (x) 
She 


For example, it follows from the definitions of ||x||, and ||<||,, that 


<C" for all 40. (2.19) 


re leh S, 
ls 


For most applications, the relation (2.19) means that the choice of a specific norm is not 
important. 

The notion of a norm is used in order to define the Lipschitz condition for functions 
in R”. Let us fix some norm ||z|| in R”. For any z € R" and r > 0, and define a closed 
ball B(a,r) by 

B(a,r) ={y eR": |lz-yl| <r}. 
For example, in R with ||z|| = |z| we have B (z,r) = [x — r,2 +r]. Similarly, one defines 
an open ball 

B(x,r) = {ye R”: |lx— yl] <r}. 


Below are sketches of the ball B (0,1) in R? for different norms: 
the 1-norm: 


the 2-norm (a round ball): 


the 4-norm: 
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the oo-norm (a box): 


2.5 Existence and uniqueness for a system of ODEs 


Let 2 be an open subset of R"*! and f = f (t,x) be a mapping from 2 to R”. Fix a norm 
||z|| in R”. 


Definition. Function f (t,) is called Lipschitz in x in Q if there is a constant L such 
that for all (t,x), (t,y) € Q 


If (tc) — fyi < Lila — yl. (2.20) 


In the view of the equivalence of any two norms in R”, the property to be Lipschitz 
does not depend on the choice of the norm (but the value of the Lipschitz constant L 
does). 

A subset K of R"*! will be called a cylinder if it has the form K = I x B where I is 
an interval in R and B is a ball (open or closed) in R”. The cylinder is closed if both I 
and B are closed, and open if both J and B are open. 


Definition. Function f (t,x) is called locally Lipschitz in x in 2 if for any (to, zo) € 2 
there exist constants ¢,d > 0 such that the cylinder 


K= [to — 6, to + 0] x B (20, €) 


is contained in 2 and f is Lipschitz in x in K. 
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Lemma 2.6 (a) [f all components f, of f are differentiable functions in a cylinder K 
and all the partial derivatives oli £ are bounded in K then the function f (t,x) is Lipschitz 
inzink. 

(b) If all partial derivatives 5 exists and are continuous in Q then f (t,x) is locally 
Lipschitz in x in Q. 


Proof. Let us use the following mean value property of functions in R”: if g is a 
differentiable real valued function in a ball B C R” then, for all x,y € B there is € € [z, y| 
such that 

g(y)—9 (2) =) 5 © (ws — 25) (2.21) 
- a] 


(note that the interval [x, y] is contained in the ball B so that a (€) makes sense). Indeed, 
consider the function 
h(t)=g(a+t(y—x)) where t € [0,1]. 


The function h (t) is differentiable on [0, 1] and, by the mean value theorem in R, there is 
€ (0,1) such that 


and setting € = «+7(y— <2), we obtain (2.21). 

(a) Let K = Ix B where J is an interval in R and B isa ballin R”. If (t,x) , (t,y) e K 
then t € J and x,y € B. Applying the above mean value property for the k-th component 
fe of f, we obtain that 


ee 


Sa (t,0) — fe (t,y) = a §) (tj — yi), (2.22) 


where € is a point in the interval [x,y] C B. Set 


Of 
Ox; 


C= ee Ne 


and note that by the hypothesis C' < oo. Hence, by (2.22) 


Ife (t,2) — fic (ty) SCD? ary — yl = Cll — yl. 


j=l 


Taking max in k, we obtain 


If (2) — f (ty) loo S Ella — ylfa. 


Switching in the both sides to the given norm ||-|| and using the equivalence of all norms, 
we obtain that f is Lipschitz in x in K. 
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(b) Given a point (to, 20) € 2, choose positive ¢ and 6 so that the cylinder 
K= [to == 6, to + 6] x B (20, €) 


is contained in 9, which is possible by the openness of 2. Since the components fy, 
are continuously differentiable, they are differentiable. Since K is a closed bounded set 


and the partial derivatives oh are continuous, they are bounded on Kk. By part (a) we 


conclude that f is Lipschitz inz in K , which finishes the proof. m 


Definition. Given a function f : Q — R”, where 2 is an open set in R"*!, consider the 


oe ee) 
T= T(r) 
{ Was, (2.23) 
where (to, Zo) is a given point in Q. A function x (t) : J — R” is called a solution (2.23) 
if the domain J is an open interval in R containing to, x(t) is differentiable in ¢ in J, 
(t,x (t)) € Q for all t € J, and x(t) satisfies the ODE a’ = f (t,x) in J and the initial 
condition x (to) = 2p. 
The graph of function x(t), that is, the set of points (t,« (t)), is hence a curve in 
Q that goes through the point (to,xo). It is also called the integral curve of the ODE 
se a 9 


Theorem 2.7 (Picard - Lindeléf Theorem) Consider the equation 
eS f(a) 


where f : Q — R” is a mapping from an open set Q C R"*! to R". Assume that f is 
continuous on Q and locally Lipschitz in x. Then, for any point (to,2%o) € Q, the initial 
value problem IVP (2.23) has a solution. 

Furthermore, if x(t) and y(t) are two solutions to the same IVP then x (t) = y(t) in 
their common domain. 


Proof. The proof is very similar to the case n = 1 considered in Theorem 2.2. We 
start with the following claim. 


Claim. A function x(t) solves IVP if and only if x(t) is a continuous function on an 
open interval I such that to € I, (t,x (t)) € Q for allt € I, and 


x(t) =a+ i f (s, x (s)) ds. (2.24) 


Here the integral of the vector valued function is understood component-wise. If x 
solves IVP then (2.24) follows from x), = fi, (t,x (t)) just by integration: 


[a (s)ds = [ fe (s,x(s)) ds 


whence 


on(t)— (eo), = f fals.2(s)) ds 
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and (2.24) follows. Conversely, if x is a continuous function that satisfies (2.24) then 


Lp = (Xo), + / fr (s, a (s)) ds. 


The right hand side here is differentiable in t whence it follows that x; (t) is differentiable. 
It is trivial that x, (to) = (xo),, and after differentiation we obtain 7, = fi, (t,x) and, 
hence, 2 = f (¢, 2); 

Fix a point (to, 70) € and let ¢,6 be the parameter from the the local Lipschitz 
condition at this point, that is, there is a constant L such that 


If (t2)- Few s Lie — ll 


for all t € [to — 6, to + 6] and x,y € B(xo,¢). Choose some r € (0, 6] to be specified later 
on, and set ae 
I= [to—r,tot+r] and J = B(a0,¢€). 


Denote by X the family of all continuous functions x (t) : [ — J, that is, 
X ={x:I— J: <x is continuous}. 


Consider the integral operator A defined on functions x (t) by 


Az (t) =a + / f (s,x(s)) ds. 


We would like to ensure that « € X implies Ax € X. Note that, for any x € X, the 
point (s,x(s)) belongs to 2 so that the above integral makes sense and the function Az is 
defined on I. This function is obviously continuous. We are left to verify that the image 
of Ax is contained in J. Indeed, the latter condition means that 


|| Ax (t) — xo|| < € for all t € J. (2.25) 


We have, for any t € J, 


t 
Jac) aol = [f s62(spas 
Jt 
Pia 
< | | f(s, (s))|| ds (see Exercise 15) 
J to 
< sup |[f(s,#)|||t— tol < Mr, 
s€l,xeJ 
where 
M= sup [If (s,2)I| <0. 
s€[to—6,to+d] 
x€B(a0,€). 


Hence, if r is so small that Mr < ¢ then (2.5) is satisfied and, hence, Ax € X. 
Define a distance function on the function family X as follows: for all x,y € X, 


d(x,y) = sup Iz) —y @II. 
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Then (X,d) is a complete metric space (see Exercise 16). 

We are left to ensure that the mapping A: X — X is a contraction. For any two 
functions x,y € X and any t € I, t > to, we have x (t) , y(t) € J whence by the Lipschitz 
condition 


Av (t) — Ay(t)|| = | 


S _ UF 2 (8) — Fls.u(s))II ds 
< , Llle(s) —y(s)llds 

= EE) UNS) 

< LIrd(z,y). 


The same inequality holds for t < tg. Taking sup in t € J, we obtain 
d(Az, Ay) < Lrd(a,y). 


Hence, choosing r < 1/Z, we obtain that A is a contraction. By the Banach fixed point 
theorem, we conclude that the equation Ax = x has a solution x € X, which hence solves 
the IVP. 

Assume that x(t) and y(t) are two solutions of the same IVP both defined on an 
open interval U C R and prove that they coincide on U. We first prove that the two 
solution coincide in some interval around tg. Let ¢ and 6 be the parameters from the 
Lipschitz condition at the point (to,29) as above. Choose 0 < r < 6 so small that the 
both functions x (t) and y (t) restricted to I = [ty — r, to +r] take values in J = B (xo, €) 
(which is possible because both x (t) and y (t) are continuous functions). As in the proof 
of the existence, the both solutions satisfies the integral identity 


x (t) = 09+ | f (s, x (s)) ds 


for all t € J. Hence, for the difference z (t) := ||x (t) — y (#)||, we have 


z(t)=|e®-y@lls II f(s, © (s)) — f(s, y (s)) [I ds, 


assuming for certainty that to <t <t) +r. Since the both points (s,x(s)) and (s, y (s)) 
in the given range of s are contained in J x J, we obtain by the Lipschitz condition 


If (s, 2 (s)) — F(s,y(s))Il S LIla (s) — y(s)ll 
whence : 
z(t) <i | z(s) ds. 
J to 
Appling the Gronwall inequality with C = 0 we obtain z(t) < 0. Since z > 0, we 


conclude that z(t) = 0 for all ¢ € [to, to +r]. In the same way, one gets that z(t) = 0 for 
t € [to — r, to], which proves that the solutions x (t) and y (t) coincide on the interval J. 
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Now we prove that they coincide on the full interval U. Consider the set 
S={teUu:z(t)=y(t)} 


and let us show that the set S is both closed and open in J. The closedness is obvious: 
if x(t.) = y (t,) for a sequence {t,} and t, — t € U as k — oo then passing to the limit 
and using the continuity of the solutions, we obtain x (t) = y(t), that is, t € S. 

Let us prove that the set S is open. Fix some t; € S. Since x (ti) = y(ti) =: %1, 
the both functions x(t) and y(t) solve the same IVP with the initial data (t;,2). By 
the above argument, x(t) = y(t) in some interval J = [t; — r,t; +r] with r > 0. Hence, 
I CS, which implies that S is open. 

Since the set S is non-empty (it contains to) and is both open and closed in U, we 
conclude by Lemma 2.4 that S = U, which finishes the proof of uniqueness. m 


Remark. Let us summarize the proof of the existence part of Theorem 2.7 as follows. 
For any point (to, 7) € 2, we first choose positive constants ¢,6,L from the Lipschitz 
condition, that is, the cylinder 


G= [to = 6, to + 6] x B (2, €) 
is contained in 2 and, for any two points (t,x) and (t, y) from G with the same t, 


If (te) - FEW < Lila — yl. 


Let 
M= sup [lf @, r)|| 


and choose any positive r to satisfy 


E 1 
= < — =, 2.2 
EOS TS a TS (2.26) 
Then there exists a solution x (t) to the IVP, which is defined on the interval [to — 1, to +7] 
and takes values in B (1p, €). 
The fact that the domain of the solution admits the explicit estimates (2.26) can be 
used as follows. 


Corollary. Under the conditions of Theorem 2.7 for any point (to,xo) € Q there are 
positive constants e« andr such that, for any ty € [to — r/2,to + 17/2] and x € B (xo, €/2), 


the IVP fn Hcg 
gi = Es 
{ ee (2.27) 


has a solution x(t) which is defined for all t € [to —1r/2,to + 1/2] and takes values in 


B (20, €). 
Proof. Let ¢,6,L,M be as in the proof of Theorem 2.7. Assuming that t, € 
[to — 6/2, to + 5/2] and x, € B(x, ¢/2), we obtain that the cylinder 


G, = [tr — 6/2,t, + 6/2] x B(x, ¢/2) 
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is contained in G. Hence, the values of LZ and M for the cylinder G; can be taken the same 
as those for G. Therefore, the IVP (2.27) has solution x (t) in the interval [t; — r,t, +r], 
and x (t) takes values in B (x1,¢/2) C B(a,<€) provided 

€ 1 


aM” ar 72 


r =min oe : 
2'2M”’ 21 }° 
If t) € [to —1r/2,to +r/2] then [tp — r/2,t9 + r/2] C [ti —1r,t1 +17] so that the solution 


x(t) of (2.27) is defined on [ty — r/2, to + 7/2] and takes value in B (x,¢), which was to 
be proved. m 


Pela rx 


For example, take 


2.6 Maximal solutions 


Consider again the ODE 

v= f(t, 2) 
where f : Q — R” is a mapping from an open set Q C R"*! to R”, which is continuous 
on 2 and locally Lipschitz in x. 

Although the uniqueness part of Theorem 2.7 says that any two solutions are the same 
in their common interval, still there are many different solutions to the same IVP because 
strictly speaking, the functions that are defined on different domains are different, despite 
they coincide in the intersection of the domains. The purpose of what follows is to define 
the maximal possible domain where the solution to the IVP exists. 

We say that a solution y (t) of the ODE is an extension of a solution x (t) if the domain 
of y (t) contains the domain of x (t) and the solutions coincide in the common domain. 


Definition. A solution x(t) of the ODE is called maximal if it is defined on an open 
interval and cannot be extended to any larger open interval. 


Theorem 2.8 Assume that the conditions of Theorem 2.7 are satisfied. Then the follow- 
ing is true. 

(a) Any IVP has is a unique maximal solution. 

(b) If x(t) and y(t) are two maximal solutions to the same ODE and x (t) = y (t) for 
some value of t, then x and y are identically equal, including the identity of their domains. 

(c) If x(t) is a maximal solution with the domain (a,b) then x (t) leaves any compact 
set K CQ ast— a andast — b. 


Here the phrase “zx (t) leaves any compact set Kv as t — b” means the follows: there is 
T € (a,b) such that for any t € (7, b), the point (t, x (t)) does not belong to kK’. Similarly, 
the phrase “z (t) leaves any compact set KK as t — a” means that there is T € (a,b) such 
that for any t € (a,T), the point (t,x (t)) does not belong to Kk. 
Example. 1. Consider the ODE vz’ = x? in the domain Q = R?. This is separable 
equation and can be solved as follows. Obviously, 7 = 0 is a constant solution. In the 
domains where x 4 0 we have 

x'dt 
[Zefa 
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whence 


and x(t) = —-4+, (where we have replaced C by —C). Hence, the family of all solutions 
consists of a straight line x (t) = 0 and hyperbolas z (t) = 4 


oz with the maximal domains 
(C, +00) and (—oo, C) (see the diagram below). 


y 50 


-25] 


-50 


Each of these solutions leaves any compact set K, but in different ways: the solutions 
a(t) = 0 leaves K as t — too because K is bounded, while x(t) = 4 leaves K as 
t — C because z(t) — too. 


2. Consider the ODE 2’ = + in the domain 2 = R x (0,+00) (that is, ¢ € R and 
x > 0). By the separation of variables, we obtain 


re 
Sa fade = foa'it= [a=tre 


ea 2(b— OC), tC. 


whence 


See the diagram below: 


Al 


Obviously, the maximal domain of the solution is (C,+00). The solution leaves any 
compact kK C Q as t — C because (t, x (t)) tends to the point (C,0) at the boundary of 
Q. 

The proof of Theorem 2.8 will be preceded by a lemma. 


Lemma 2.9 Let {2a (t)}oe4 
index set, and let the domain of x, be an open interval I,. Set I =\J 
function x(t) on I as follows: 


be a family of solutions to the same IVP where A is any 
I, and define a 


acA 


elt) = tall) Gre das (2.28) 
Then I is an open interval and x (t) is a solution to the same IVP on I. 


The function « (t) defined by (2.28) is referred to as the union of the family {xq (t)}. 

Proof. First of all, let us verify that the identity (2.28) defines x(t) correctly, that 
is, the right hand side does not depend on the choice of a. Indeed, if also t € J, then t 
belongs to the intersection [, [3 and by the uniqueness theorem, x, (t) = xg (t). Hence, 
the value of x (t) is independent of the choice of the index a. Note that the graph of z (t) 
is the union of the graphs of all functions x, (t). 

Set a = inf J, b = sup/ and show that J = (a,b). Let us first verify that (a,b) C J, 
that is, any t € (a,b) belongs also to J. Assume for certainty that t > to. Since b = sup J, 
there is t; € J such that t < t; < 6. There exists an index a such that t; € J ,. Since 
also to € I,, the entire interval [to,¢,] is contained in J,. Since t € [to, ti], we conclude 
that t © I, and, hence, t € I. 

It follows that J is an interval with the endpoints a and 6. Since J is the union of open 
intervals, J is an open subset of R, whence it follows that J is an open interval, that is, 
i = (0,0), 

Finally, let us verify why x(t) solves the given IVP. We have x (to) = xo because 
to € I, for any a and 

v (to) = La (to) = Lo 
so that x (t) satisfies the initial condition. Why x (t) satisfies the ODE at any t € J? Any 
given t € I belongs to some J,. Since %, solves the ODE in J, and x = %q on Ig, we 
conclude that x satisfies the ODE at t, which finishes the proof. m 


A2 


Proof of Theorem 2.8. (a) Consider the IVP 


{ ee hee (2.29) 


x (to) = Xo 


and let S be the set of all possible solutions to this IVP defined on open intervals. Let 
x(t) be the union of all solutions from S. By Lemma 2.9, the function x(t) is also a 
solution to the IVP and, hence, x(t) € S. Moreover, x (t) is a maximal solution because 
the domain of x (t) contains the domains of all other solutions from S and, hence, «x (t) 
cannot be extended to a larger open interval. This proves the existence of a maximal 
solution. 

Let y(t) be another maximal solution to the IVP and let z(t) be the union of the 
solutions x(t) and y(t). By Lemma 2.9, z(t) solves the IVP and extends both x (t) and 
y (t), which implies by the maximality of 7 and y that z is identical to both x and y. 
Hence, x and y are identical (including the identity of the domains), which proves the 
uniqueness of a maximal solution. 

(b) Let x(t) and y(t) be two maximal solutions that coincide at some t, say t = fy. 
Set 7; = x(t1) = y(t1). Then both z and y are solutions to the same IVP with the initial 
point (t1, 71) and, hence, they coincide by part (a). 

(c) Let x(t) be a maximal solution defined on (a,b) where a < b, and assume that 
x(t) does not leave a compact AK C 92 as t — a. Then there is a sequence t, — a such 
that (t,, 7.) € K where x, = x(t,). By a property of compact sets, any sequence in 
has a convergent subsequence whose limit is in kK. Hence, passing to a subsequence, we 
can assume that the sequence {(t,, 7) }7., converges to a point (to, 29) € K as k > ov. 
Clearly, we have to = a, which in particular implies that a is finite. 

By Corollary to Theorem 2.7, for the point (to, zo), there exist r,¢ > 0 such that the 
IVP with the initial point inside the cylinder 


G = [to — r/2,to + r/2] x B (x0, €/2) 


has a solution defined for all t € [to — r/2,to + r/2]. In particular, if k is large enough 
then (tz, 7%) € G, which implies that the solution y(t) to the following IVP 


eon 


y (te) = @r, 


is defined for all t € [to — r/2, to + 7/2] (see the diagram below). 
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B(xo,e/2) 


[to-r/2, to+r/2] t 


Since x (t) also solves this IVP, the union z (t) of x(t) and y(t) solves the same IVP. 
Note that x(t) is defined only for t > to while z(t) is defined also for t € [to — r/2, to]. 
Hence, the solution x(t) can be extended to a larger interval, which contradicts the 
maximality of x(t). ™ 


Remark. By definition, a maximal solution x(t) is defined on an open interval, say 
(a,b), and it cannot be extended to a larger open interval. One may wonder if x (t) can 
be extended at least to the endpoints t = a or t = 0. It turns out that this is never the 
case (unless the domain 2 of the function f (t,x) can be enlarged). Indeed, if x (t) can 
be defined as a solution to the ODE also for t = a then (a, (a)) € Q and, hence, there is 
ball B in R"*? centered at the point (a,x (a)) such that BC Q. By shrinking the radius 
of B, we can assume that the corresponding closed ball B is also contained in Q. Since 
x(t) + x(a) ast — a, we obtain that (t,x (t)) € B for all t close enough to a. Therefore, 
the solution x (t) does not leave the compact set B C Q as t — a, which contradicts part 
(c) of Theorem 2.8. 


2.7 Continuity of solutions with respect to f (t,x) 


Consider the IVP Laated 
eS (ye 
{ iia (2.30) 
In Section 2.2, we have investigated in the one dimensional case the dependence of the 
solution x(t) upon the initial value 7). A more general question, which will be treated 
here, is how the solution x (t) depends on the right hand side f (t, 7). The dependence on 
the initial condition can be reduced to the dependence of the right hand side as follows. 
Consider the function y (t) = x(t) — xo, which obviously solves the [VP 


nai (2.31) 


4A 


Hence, if we know that the solution y (t) of (2.31) depends continuously on the right hand 
side, then it will follow that y(t) is continuous in xo, which implies that also the solution 
x (t) of (2.30) is continuous in 2. 

Let be an open set in R”*! and f,g be two functions from 2 to R". Assume in 
what follows that both f,g are continuous and locally Lipschitz in x in 2, and consider 
two initial value problems 

/ 
{ Bde (2.32) 


4 0 (to) = Xo 

and ; 
=g(t 

y= 9 (ty) (2.33) 

y (to) = Lo 
where (to, Zo) is a fixed point in 2. 

Assume that the function f as fixed and x (t) is a fixed solution of (2.32). The function 

g will be treated as variable.. Our purpose is to show that if g is chosen close enough 
to f then the solution y (t) of (2.33) is close enough to x(t). Apart from the theoretical 
interest, this question has significant practical consequences. For example, if one knows 
the function f (t,2) only approximately (which is always the case in applications in Sci- 
ences and Engineering) then solving (2.32) approximately means solving another problem 
(2.33) where g is an approximation to f. Hence, it is important to know that the solution 
y (t) of (2.33) is actually an approximation of « (t). 


Theorem 2.10 Let x(t) be a solution to the IVP (2.32) defined on an interval (a,b). 
Then, for all real a, such thata <a< to < 8 < b and for any < > 0, there is n > 0 
such that, for any function g :Q— R” such that 


sup lf -—gll <n, (2.34) 


there is a solution y(t) of the IVP (2.33) defined in [a, 6], and this solution satisfies the 
inequality 
sup ||" (t) — y (f)|| Se. 
[2,8] 
Proof. For any ¢ > 0, consider the set 


K, = {(t,2) €R™* :a<t< 8B, ||le-2(|| <e} (2.35) 


which can be regarded as the ¢-neighborhood in R"*! of the graph of the function t + z (t) 
where ¢t € [a,@]. In particular, Ko is the graph of the function x(t) on [a, (] (see the 
diagram below). 
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The set Ko is compact because it is the image of the compact interval [a, 3] under the 
continuous mapping t +> (t,(t)). Hence, Ko is bounded and closed, which implies that 
also K. for any ¢ > 0 is also bounded and closed. Thus, K; is a compact subset of R"+! 
for any € > 0. 

Claim 1. There is ¢ > 0 such that Kz CQ and f is Lipschitz in x in Kg. 

Indeed, by the local Lipschitz condition, for any point (t,, 7.) € Q (in particular, for 

any (tx, 2) € Ko), there are constants ¢,d > 0 such that the cylinder 


G = [t, —6,t. + 6] x B(a.,€) 


is contained in Q and f is Lipschitz in G (see the diagram below). 


Varying the point (t,,2,) in Ko, we obtain a cover of Ko by the family of the open 
cylinders H = (t, —0,t. +6) x B(a,,¢/2) where ¢,d depend on (t,,2,). Since Ko is 
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compact, there is a finite subcover, that is, a finite number of points {(t;,x7;)};", on Ko 
and the corresponding numbers ¢;, 6; > 0, such that the cylinders 


cover all Ko. Set 7 
Gi; = [t; — 6;,t; + ;] x B (aj, €;) 


and let L; be the Lipschitz constant of f in G;, which exists by the choice of ¢;,6;. Set 


1 
€=- min ¢,and L= max L; (2.36) 


2 1<i<m 1<i<m 


and prove that K, C 2 and that function f is Lipschitz in kK. with the constant L. For 
any point (t,2) € K., we have by the definition of K- that t € [a, 6], (t,x (t)) € Ko and 


lz —a(t)|| <e. 
The point (t, x (t)) belongs to one of the cylinders H; so that 
te (t; — 6;,t; + 6:) and \|x (£) — a;|| <6; /2 


(see the diagram below). 


B(x; ;) 


B(x-€/2) 


By the triangle inequality, we have 
||z — al < lla -— 2 @)|| + la) —ai||<e+e/2< &, 


where we have used that by (2.36) ¢ < ¢;/2. Therefore, x € B(a;,¢;) whence it follows 
that (t,7) € G; and, hence, (t,7) € Q. Hence, we have shown that any point from K, 
belongs to 2, which proves that K, C2. 


AT 


If (t,x) ,(t,,y) € K- then by the above argument the both points x,y belong to the 
same ball B (x;, €;) that is determined by the condition (t, x (t)) € H;. Then (t,x), (t,,y) € 
G; and, since f is Lipschitz in G; with the constant D;, we obtain 


If (2)- Fy) < Lille —yll < Lila— yl, 


where we have used the definition (2.36) of L. This shows that f is Lipschitz in x in K,; 
and finishes the proof of Claim 1. 

Observe that if the statement of Claim 1 holds for some value of ¢ then it holds for 
all smaller values of ¢ as well, with the same L. Hence, we can assume that the value of 
€ from Theorem 2.10 is small enough so that it satisfies Claim 1. 

Let now y (t) be the maximal solution to the IVP (2.33), and let (a’, b’) be its domain. 
By Theorem 2.8, the graph of y(t) leaves K- when t — a’ and when t — 0’. Let (a’, 6’) 
be the maximal interval such that the graph of y(t) on this interval is contained in K;; 
that is, 

a’ = inf {t € (a, 8) N(a’, 0’) : (s,y(s)) € Ke for all s € [E, to]} (2.37) 


and (' is defined similarly with inf replaced by sup (see the diagrams below for the cases 
a’ > a and a’ = a, respectively). 
XA xXA 


: 7 ; - > 7 i — > 
a Qa t s ft Bp OB t a=a to B B 


In particular, (a’, 3’) is contained in (a’, b')M (a, 8), function y (¢t) is defined on (a’, 6’) 
and 


(t, y (t)) € Kz for all t € (a’, 8’). (2.38) 


Claim 2. We have [a’, B'| C (a’,b’); in particular, y (t) is defined on the closed interval 
[a’, 3']. Moreover, the following is true: either a! = a or 


a >a and |x (t)—y(t)|| =e fort =a’. (2.39) 
Similarly, either B’ = B or 
B'<B and |lx(t)— y(t) =e fort =". 
By Theorem 2.8, y(t) leaves K. as t — a’. Hence, for all values of ¢ close enough to 


a’ we have (t, y (t)) ¢ K-. For any such t we have by (2.37) t < a’ whence a’ < t < a and 
a’ <a’. Similarly, one shows that b/ > 3", whence the inclusion [a’, 3’] C [a’, b'] follows. 
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To prove the second part, assume that a’ 4 a that is, a’ > a, and prove that 
lz (t) — y (8)|| =e for t =a’. 


The condition a’ > a together with a’ > a’ implies that a’ belongs to the open interval 
(a, B) O(a’, U’). It follows that, for 7 > 0 small enough, 


(a’ —7T,a’+7)C (a, B)N(a,0). (2.40) 


For any t € (a’, 3’), we have 

Iz (@) —y@|| Se. 
By the continuity, this inequality extends also to t = a’. We need to prove that, for 
t =a’, equality is attained here. Indeed, if 


IIx (t) — y (t)|| < e for t= a 


then, by the continuity of x(t) and y(t), that the same inequality holds for all t € 
(a! —7T,a' +7) provided tT > 0 is small enough. Choosing 7 to satisfy also (2.40), we 
obtain that (t, y (t)) € K. for all t € (a! — r,a’], which contradicts the definition of a’. 


Claim 3. For any given a, ,¢ as above, there exists n > 0 such that if 


sup lf -—gll <n, (2.41) 


then [a’, 6'] = [a, 8]. 

In fact, Claim 3 will finish the proof of Theorem 2.10. Indeed, Claims 2 and 3 im- 
ply that y(t) is defined on [a, 8]; by the definition of a’ and 3’ (see (2.38)), we obtain 
(t, y (t)) € K- for all t € (a, 8), and by continuity, the same holds for t € [a, 6]. By the 
definition (2.35) of K., this means 


lly) — x @)|| <¢ for all t € [a, 6], 


which was the claim of Theorem 2.10. 
To prove Claim 3, for any t € [a’, 3’] write the integral identities 


(t)=a0+ | f(s.2(s))ds 


and 
y (t) =2o+ : g(s,y(s)) ds. 


Assuming for simplicity that t > to and using the triangle inequality, we obtain 


t 
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ae the points (s,x(s)) and (s,y(s)) are in K., we obtain by the Lipschitz condition in 
kK, (Claim 1) that 


le@) —y@I SZ five es) —9(s)||ds +supllf al) (B—a). 2.42) 


Hence, by the Gronwall lemma applied to the function z (t) = ||x (t) — y (t)|I, 
IBY Ol S (b= 0) exp Lt to) sup |g 
= (ee) ope o)sun lt a; (2.43) 
In the same way, (2.43) holds for t < to so that it is true for all t € [a’, 6’). 
Now choose 77 in (2.41) as follows 


__€ 4 L(8-a) . 


— -2(8—a) 
Then it follows from (2.43) that 
I|x (t) — y (t)|| < e/2 <« for all t € [a’, 6’). (2.44) 


By Claim 2, we conclude that a! = a and ~’ = 3, which finishes the proof. m 

Using the proof of Theorem 2.10, we can refine the statement of Theorem 2.10 as 
follows. 
Corollary Under the hypotheses of Theorem 2.10, let x (t) be a solution to the IVP (2.32) 
defined on an interval (a,b), and let a,8 be such thata<a<tp< 6B <b. Lete>0 
be sufficiently small so that f (t,x) is Lipschitz in x in K. with the Lipschitz constant L. 
If supx. || f — g|| 1s sufficiently small, then the IVP (2.33) has a solution y(t) defined on 
[a, 3], and the following estimate holds 
sup ||z (t) — y (é)|| < (8 - a) e"F sup || f — all. (2.45) 


[a,8] 


Proof. By Claim 2 of the above proof, the maximal solution y(t) of (2.33) is de- 
fined on [a’, 6’]. Also, the difference ||z (t) — y (t)|| satisfies (2.43) for all t € [a’, 6’). If 
sup. || f — g|| is small enough then by Claim 3 [a’, 6"] = [a, 6]. It follows that y (t) is 
defined on |a, 6] and satisfies (2.45). m 


2.8 Continuity of solutions with respect to a parameter 


Consider the IVP with a parameter s € R” 


patie! - 


where f :Q — R” and Q is an open subset of R"*™*'. Here the triple (t,x, s) is identified 
as a point in R"*’*+! as follows: 


(E058) = (b-Wises Tag Sy 4 Spa) 
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How do we understand (2.46)? For any s € R™, consider the open set 


= {(t, z) ER" : (t,2,8) € Qt 
Denote by S the set of those s, for which 9, contains (to, Zo), that is, 


S={s €R™: (to, %0) € 2} ={s € R™: (to, 20, 8) € QF 


R™! 


“(7 0% 0) 


Then the IVP (2.46) can be considered in the domain 2, for any s € S. We always 
assume that the set S is non-empty. Assume also in the sequel that f (t, 7, s) is a contin- 
uous function in (t,x, s) € Q and is locally Lipschitz in x for any s € S. For any s € S, 
denote by x (t, 5) the maximal solution of (2.46) and let J, be its domain (that is, /, is an 
open interval on the axis t). Hence, x(t, s) as a function of (t, s) is defined in the set 


U={(t,s)ER™' :s€S,tel,}. 


Theorem 2.11 Under the above assumptions, the set U is an open subset of R"** and 
the function x (t,s) : U — R” is continuous in (t, s). 


Proof. Fix some so € S and consider solution x(t) = x(t, so) defined for t € I,,. 
Choose some interval [a, 3] C J,, such that to € [a, 3]. We will prove that there is ¢ > 0 
such that 

[a, B] x B(so,e) CU, (2.47) 


which will imply that U is open. Here B(so,¢) is a closed ball in R™ with respect to 
oo-norm (we can assume that all the norms in various spaces R* are the oo-norms). 


dl 


sER” 


As in the proof of Theorem 2.10, consider a set 
Ke={ta) eR se<0< 7, ||e=a)||<e} 
and its extension in R"t™*! defined by 
K. = Kz x B(s0,€) = {(t,2,s) ER'*™ :a<t< B,||z —x(t)|| <e,||s— 50l| <e} 
(see the diagram below). 


sh 


B(s0,€) { F " 


R, =K, x B(so,€) 


If €¢ is small enough then K. is contained in Q (cf. the proof of Theorem 2.10 and 
Exercise 26). Hence, for any s € B(so,¢), the function f (t,x, s) is defined for all (t,x) € 
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K.. Since the function f is continuous on Q, it is uniformly continuous on the compact 
set K., whence it follows that 


sup ||f (t,2, 50) — f (t,2,s)|| ~ 0as s — 8p. 
(t,c)EKe 


Using Corollary to Theorem 2.10 with? f (t,x) = f (t,2, 0) and g (t,x) = f (t,,s) where 
s € B(s9,€), we obtain that if 


sup ||f (t,2,8) — f (t,2, s0)| 
(t,2)E Ke 


is small enough then then the solution y (t) = x (t,s) is defined on [a, 3]. In particular, 
this implies (2.47) for small enough ¢. Furthermore, by Corollary to Theorem 2.10 we 
also obtain that 


sup lle (t,8) —#(t,0)|| <C sup If (t,2,50) — f (t,2,)]), 
te[a,5] (t,2)EKe 


where the constant C' depending only on a,,¢ and the Lipschitz constant L of the 


function f (t,x, 59) in K.. Letting s — 59, we obtain that 


sup ||a (t,8) — x(t, 80)|| > 0 as s > 89, 


so that x (t,s) is continuous in s at so uniformly in t € [a, 3]. Since x (t, s) is continuous 
in t for any fixed s, we conclude that x is continuous in (t,s) (see Exercise 28), which 
finishes the proof. m 


2.9 Global existence 
Theorem 2.12 Let I be an open interval in R and let f (t,7) : I x R" — R” be a 
continuous function that is locally Lipschitz in x and satisfies the inequality 


If (2) <a lel +6), (2.48) 


for allt € I anda € R", where a(t) and b(t) are some continuous non-negative functions 
of t. Then, for all tp € I and x € R”, the initial value problem 


{ = Fe) (2.49) 


x (to) = Xo 
has a (unique) solution x (t) on I. 


In other words, under the specified conditions, the maximal solution of (2.49) is defined 
on I. 

Proof. Let x(t) be the maximal solution to the problem (2.49), and let J = (a, 2) 
be the open interval where x (t) is defined. We will show that J = J. Assume from the 
contrary that this is not the case. Then one of the points a, 9 is contained in J, say 3 € I. 


5Since the common domain of the functions f (t,x, s) and f (t,2, so) is (t,s) € Q,, Qs, Theorem 2.10 
should be applied with this domain. 
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Let us investigate the behavior of ||x (t) || as t ~ 6. By Theorem 2.8, (t,x (t)) leaves any 
compact K C 2) := JI x R”. Consider a compact set 
K = [6 —«, 8] x B(0,r) 


where € > 0 is so small that [3 — ¢,6] C I. Clearly, kK Cc 2. If t is close enough to 6 then 
t € [8 —e, 8]. Since (t,x (t)) must be outside K, we conclude that x ¢ B(0,r), that is, 
||x (t)|| > 7. Since r is arbitrary, we have proved that || (t)|| ~ co ast — (. 

On the other hand, let us show that the solution x (t) must remain bounded as t > (3. 
From the integral equation 


v(t)=a0+ ff (s2(s))ds 


we obtain, for any t € [to, 3), 


lel < lel + f IF (s.2 (lds 


0 


A 


< ||xol] + / (a (s) [lz (s)I| + 6(s)) ds 


IA 


t 
c+] 2 (s)|) ds, 
J to 


where F 
A=supa(s) and C= ||xo||+ i b(s) ds. 
[to,8] J to 


Since [to, 6] C I and functions a (s) and b(s) are continuous in [to, 3], the values of A and 
C are finite. The Gronwall lemma yields 


Iz (é)|| < C exp (A (t — to)) < Cexp (A (8 — to)). 


Since the right hand side here does not depend on t, we conclude that the function ||x (¢)|| 
remains bounded as t — 3, which finishes the proof. m 

Example. We have considered above the ODE 2’ = 2? defined in R x R and have 
seen that the solution x (t) = aaa cannot be defined on full R. The same occurs for the 
equation x’ = «° for a > 1. The reason is that the function f (t,7) = x* does not admit 
the estimate (2.48) for large x, due to a > 1. This example also shows that the condition 
(2.48) is rather sharp. 


A particularly important application of Theorem 2.12 is the case of the linear equation 
v= A(t)a+B(t), 


where x € R”, t € I (where J is an open interval in R), B: J — R", A: 1 = R™”. Here 
R"*" is the space of all n x n matrices (that can be identified with R"’). In other words, 
for each t € I, A(t) is ann x n matrix, and A(t) x is the product of the matrix A (t) and 
the column vector x. In the coordinate form, one has a system of linear equations 


a = So Ani (t) u+ By, (t) 5 
i=1 
for any k = 1,...,n. 
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Theorem 2.13 In the above notation, let A(t) and B(t) be continuous int € I. Then, 
for any to € I and x € R”, the IVP 


{ r' = A(t)x+ B(t) 


x (to) = Xo 
has a (unique) solution x (t) defined on I. 


Proof. It suffices to check that the function f (t,2) = A(t) «+ B (t) satisfies the con- 
ditions of Theorem 2.12. This function is obviously continuous in (t,2) and continuously 
differentiable in x, which implies by Lemma 2.6 that f (t,x) is locally Lipschitz in x. 

We are left to verify (2.48). By the triangle inequality, we have 


lf 2) I< |A@ al] + |B I. (2.50) 
Let all the norms be the oo-norm. Then 


b(t) = |B OI]. = max |B; ()| 


is a continuous function of t. Next, 


AO) atl = max|(A (#)),| = max 


S- Api (t) ZI 
l=1 


< €>» [Au om max || = 0 (4) [too 
where - 

a(t) = max 2 | Ain (t)| 
is a continuous function. Hence, we obtain from (2.50) 


IF) <@@ lle +6), 


which finishes the proof. m 


2.10 Differentiability of solutions in parameter 
Consider again the initial value problem with parameter 
GE tS) 
pes: 2.51 
{ n=, (2-51) 


where f : Q — R” is a continuous function defined on an open set 2 C R"*”™*t! and where 
(t,x, 5) = (t,01,...,%n, $1,--., Sm). Let us use the following notation for Jacobian matrices 
of f with respect to x and s. Set 


Of _ f Ofi 


where 7 = 1,...,n is the row index and k = 1,...,n is the column index, so that f, is an 


n Xn matrix. Similarly, set 
Of Ofi 
SoS an oe Os = ’ 
I Os J (54) 


59 


where 7 = 1,...,n is the row index and | = 1,...,m is the column index, so that f, is an 
n xX m matrix. 

If f, is continuous in 2 then by Lemma 2.6 f is locally Lipschitz in x so that all the 
previous results apply. 

Let x(t,s) be the maximal solution to (2.51). Recall that, by Theorem 2.11, the 
domain U of x (t, 5) is an open subset of R”*! and x : U — R” is continuous. 


Theorem 2.14 Assume that function f (t,x,s) is continuous and f, and f, exist and 
are also continuous in Q. Then x (t,s) is continuously differentiable in (t,s) € U and the 
Jacobian matrix y = 0,x solves the initial value problem 


y' = fx (t, 2 (t,8),s)y + fs (t, 2 (,8),8), 
2.52 
{ y (to) = 0. eee) 
Here 0,2 = (3) is ann xm matrix where k = 1,.., n is the row index and / = 1,...,m 


is the column index. Hence, y = 0,7 can be considered as a vector in R"*”™ depending on 
t and s. The both terms in the right hand side of (2.52) are also n x m matrices so that 
(2.52) makes sense. Indeed, f, is an n x m matrix, and f,y is the product of the n x n 
and n x m matrices, which is again an n x m matrix. 

The ODE in (2.52) is called the variational equation for (2.51) along the solution 
x (t,s) (or the equation in variations). 

Note that the variational equation is linear. Indeed, for any fixed s, its right hand side 
can be written in the form 

¥=Ay +B), 

where A(t) = f, (t,v(t,5),s) and B(t) = f,(t,x(t,s),s). Since f is continuous and 
x(t, s) is continuous by Theorem 2.11, the functions A(t) and B (t) are continuous in t. 
If the domain in ¢ of the solution x (t, s) is J, then the domain of the variational equation 
is J, x R"*™. By Theorem 2.13, the solution y (t) of (2.52) exists in the full interval J,. 
Hence, Theorem 2.14 can be stated as follows: if x (t,s) is the solution of (2.51) on J, 
and y(t) is the solution of (2.52) on J, then we have the identity y(t) = 0.x (t, s) for all 
t € I,. This provides a method of evaluating O,x (t, 5) for a fixed s without finding z (t, s) 
for all s. 


Example. Consider the IVP with parameter 


a 


in the domain (0, +00) x R x R (that is, t > 0 and x, s are arbitrary real). Let us evaluate 
x(t, s) and 0,x for s = 0. Obviously, the function f (t, 7, s) = 2?+2s/t is continuously dif- 
ferentiable in (x, s) whence it follows that the solution x (t, s) is continuously differentiable 
in (t, s). 

For s = 0 we have the IVP 


ag? 
x(1)=-1 
whence we obtain x(t,0) = —+. Noticing that f, = 2x and f, = 2/t we obtain the 


variational equation along this solution 
Ds 9 
v= (fe t.23)l-a 0) 9+ (66D 2 so) =— 59 +5. 
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This is a linear equation of the form y’ = a(t) y + b(t) which is solved by the formula 
y = eA f eAD0) dt, 
where A (t) is a primitive of a(t) = —2/t, that is A(t) = —2Int. Hence, 
ha eae [era =t7(?+C)=14+Ct”. 
The initial condition y (1) = 0 is satisfied for C = —1 so that y (t) = 1—t-*. By Theorem 


2.14, we conclude that 0,x (t,0) =1—t-. 
Expanding x (t,s) as a function of s by the Taylor formula of the order 1, we obtain 


x (t,s) =x (t,0) + 0,2 (t,0)s+0(s) ass 0, 


that is, 


1 1 
ge) = - (1 3) 8+0(s) as s — 0. 


In particular, we obtain for small s an approximation 


1 1 
x(t.) —7 + 1-3 Ss. 


Later we will be able to obtain more terms in the Taylor formula and, hence, to get a 
better approximation for 2 (t, s). 


Remark. It is easy to deduce the variational equation (2.52) provided we know that the 
function x (t,s) is sufficiently many times differentiable. Assume that the mixed partial 
derivatives 0,0;2 and 0,0,x exist and are the equal (for example, this is the case when 
x (t,s) € C?(U)). Then differentiating (2.51) in s and using the chain rule, we obtain 


0,0; = 0; (0:0) = 0; (7 (or (6 5)43)) =] behs).s) 0.20 + (62 Gs).3)5 


which implies (2.52) after substitution 0,7 = y. Although this argument is not a proof 
of Theorem 2.14, it allows to memorize the variational equation. The main technical 
difficulty in the proof of Theorem 2.14 is verifying the differentiability of x in s. 

How can one evaluate the higher derivatives of x (t, s) in s? Let us show how to find 
the ODE for the second derivative z = 0,,2 assuming for simplicity that n = m = 1, that 
is, both x and s are one-dimensional. For the derivative y = 0,” we have the IVP (2.52), 
which we write in the form 


y= 9 (ty, ) 
{ (i) =0 (2.53) 
where 
g (t, Y, s) = fee (t, v (t, s) ,8) 7] a fs (t, x (Z, s) ’ s) (2.54) 


For what follows we use the notation F (a,b,c, ...) € C* (a,b, ¢,...) if all the partial deriva- 
tives of the order up to & of the function F' with respect to the specified variables a, b,c... 
exist and are continuous functions, in the domain of F’. For example, the condition in 
Theorem 2.14 that f, and f, are continuous, can be shortly written as f € C1 (x, s), and 
the claim of Theorem 2.14 is that x (t,s) € C' (t,s). 
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Assume now that f € C?(z,s). Then by (2.54) we obtain that g is continuous and 
g € C'(y,s), whence by Theorem 2.14 y € C'(s). In particular, the function z = 0,y = 
0,;t is defined. Applying the variational equation to the problem (2.53), we obtain the 
equation for z 


eS = gy (ty (t, 8), 8) z+ 9s (t, y(t, 8), 8). 
Since 9, = f, (1,2, 8), 


9s (t, Y, 8) = faa (t, 0,8) (Ost) y + fas (t, 0,8) y + fea (6,2, 8) O50 + fos (6,2, 8) , 
and 0,7 = y, we conclude that 


{ 2! = fz (t, 2,8) 2+ fox (t, 2,8) y? + 2fes (t, 2,8) y + fos (t, 2, 8) 


Pb =0. (2.55) 


Note that here x must be substituted by x (t,s) and y — by y(t, s). 

The equation (2.55) is called the variational equation of the second order, or the second 
variational equation. It is a linear ODE and it has the same coefficient f, (t, x (t, 5) , s) 
in front of the unknown function as the first variational equation. Similarly one finds the 
variational equations of the higher orders. 


Example. This is a continuation of the previous example of the IVP with parameter 


a = 2? +2s/t 
x(1)=-1 


where we have computed that 


z(t) =2(4,0) =-5 ad WO =o -5. 


Let us now evaluate z = 0,,x (t,0). Since 
fo = 22, far =2, fas =0, fos = 0, 
we obtain the second variational equation 
Pas fleets) z+ eles) y 
= —f242 (=e 


Solving this equation similarly to the first variational equation with the same a(t) = — 
and with b(t) =2(1—t-®)’, we obtain 


2 
t 


: (t) eA) feo (t) dt = t-? i ot (1 = zoe dt 


9 2 2 Be cd. ef) 
Rep? SE he oe At = —t — 
: & f aoe 3° ff UE BP 


The initial condition z(1) = 0 yields C = 2 whence 


2 2 4 16 
ORO) ab a ag 
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Expanding x (t,s) at s = 0 by the Taylor formula of the second order, we obtain as 
s—0 


elt.s) = 2b) yo s+ s(t) s* +0 (s”) 


i _ 4-2 | 1 2 8 l 2, 2 
eat t )s+ (5 71 3p 2) § + o(s”). 


merical methods (MAPLE), the first order approximation u (t) = —}+(1—t~*) s (green) 
| 2 


and the second order approximation v (t) = —++(1— #7?) s+ ($¢ s, — 7) s* (red). 


-0.05 7 


-0.15 7 


-0.3 7 


-0.47 
-0.45 7 


-0.55 7 


-0.65 7 
x -0.77— 


Let us discuss an alternative method of obtaining the equations for the derivatives of 
x (t,s) ins. As above, let x(t), y(t), z(t) be respectively x (t,0), 0.x (t,0) and 0.x (t, 0) 
so that by the Taylor formula 


26) =20 49 G2 2 (t) 3? +0 (8). (2.56) 


Let us write a similar expansion for x’ = 0:7, assuming that the derivatives 0, and 0, 
commute on x. We have 
0,2 =002=y 


and in the same way 


Ot =O4y =D, —2 . 
Hence, 
1 
v'(t,s)=a' (t)+y (s+ 52 (t) 5? +0(s”). 
Substituting this into the equation 


a = 2? +2s/t 
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we obtain 

a’ (t)+y! (t)st+ 52 (t) s’? +0(s*) = (« (t)+y(t)s+ x (t)s’ +0 (°)) + 28/t 
whence 
x (t)+y'(t)s+ x (t) s? =a? (t) + 2 (t)y(t)s+ (y (t)? + 2 (t) z(t) s?+2s/t+o (s”) : 


Equating the terms with the same powers of s (which can be done by the uniqueness of 
the Taylor expansion), we obtain the equations 


at) == er) 
y(t) = 2x (t) y(t) + 2s/t 
Z(t) = 22x (t)z(t)4+ 2y’ (0). 


From the initial condition x (1, s) = —1 we obtain 


-1=2(1) +sy(1) + >2(1) +0(s?), 


whence x (t) = —1, y(1) = z (1) =0. Solving successively the above equations with these 
initial conditions, we obtain the same result as above. 


Before we prove Theorem 2.14, let us prove some auxiliary statements from Analysis. 


Definition. A set K C R” is called convex if for any two points x,y € K, also the full 
interval [x,y] is contained in K, that is, the point (1 — A) a+ Ay belong to K for any 
A € (0, 1]. 


Example. Let us show that any ball B(z,r) in R” with respect to any norm is convex. 
Indeed, it suffices to treat the case z = 0. If x,y € B(0,r) then ||z|| < r and |ly|| <r 
whence for any A € [0, 1] 


I|(1 — A) @ + Ayl] < (A) Ilel] + Allyl] <r. 
It follows that (1 — A) a+ Ay € B(0,r), which was to be proved. 


Lemma 2.15 (The Hadamard lemma) Let f (t,x) be a continuous mapping from Q to 
R! where Q is an open subset of R°*! such that, for any t € R, the set 


See R 26a) eo} 


is conver (see the diagram below). Assume that f, (t,x) exists and is also continuous in 
Q. Consider the domain 


Q/ 


{(t,2,y)€ RR" :teER, c,yeQ$ 
{(t,2,y) € R7"*" : (é,2) and (t,y) €Q}. 


Then there exists a continuous mapping 9 (t,2,y) : Q' — R'*" such that the following 
identity holds: 


ftw —-fta=~e(t2,y)y-2) 
for all (t,x,y) € (here y(t,xz,y)(y—) is the product of the | x n matrix and the 
column-vector). 
Furthermore, we have for all (t,x) € Q the identity 


OL Go) = Felt): (257) 
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é 
t 


N 
N. 


Remark. The variable t can be multi-dimensional, and the proof goes through without 
changes. 
Since f (t, 2) is continuously differentiable at x, we have 


fy) —-f (2) = fe (t,2) (y— x) + 0(|ly — 2|)) as yY 7 


The point of the above Lemma is that the term o(||z— y||) can be eliminated if one 
replaces f, (t,x) by a continuous function ¢ (t, 2, y). 


Example. Consider some simple examples of functions f (x) with n = 1 = 1 and without 
dependence on t. Say, if f (2) = 2? then we have 


fy) - f(x) =(+2)(y-2) 


so that y (x,y) =y+4. In particular, y (x, x) = 2x = f’ (x). A similar formula holds for 
f (cz) =2* with any k EN: 


fy) — fe) = (et tak %y +... $y) Y—2). 
For any continuously differentiable function f (x), one can define y (2, y) as follows: 
f=f@) 
= ee 
= y-z 
9 (x,y) { oa. wee. 


It is obviously continuous in (x,y) for « # y, and it is continuous at (x,x) because if 
(Xk, Yk) — (x, 2) as k — oo then 


f (ye) — f (te) 


Uk — Xr, 


=f (Ex) 
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where €; € (Xx, yx), which implies that €,, - x and hence, f" (&,) — f’ (x), where we have 
used the continuity of the derivative f’ (2). 

Clearly, this argument does not work in the case n > 1 since one cannot divide by 
y — x. In the general case, we use a different approach. 


Proof of Lemma 2.15. It suffices to prove this lemma for each component f; 
separately. Hence, we can assume that | = 1 so that y is a row (),...,Y,,). Hence, we 
need to prove the existence of n real valued continuous functions ¥,..., Y,, of (t,2, y) such 
that the following identity holds: 


f (ty) fb2) =) ei(b2,9) wi -2), 


Fix a point (t,x, y) € 2’ and consider a function 
FO)=f(t,2+A(y—2)) 


on the interval \ € [0,1]. Since z,y € Q, and Q, is convex, the point 2 + A(y— 2) 
belongs to (Q;. Therefore, (t,2 + A (y— x)) € © and the function F(A) is indeed defined 
for all A € [0,1]. Clearly, F (0) = f (t,x), F(1) = f(t,y). By the chain rule, F (A) is 
continuously differentiable and 


F'() = So fy (be + A(y— 2) (yi — 23). 


i=1 


By the fundamental theorem of calculus, we obtain 
1 

| F'(A)da 

Jo 


n 
i= 


ay te; (1,2 pAly= 2) (ys — v4) dA 


S~ 9; (t,2,y) (vi — 21) 
i=l 


where : 
PACE i fe, (t,2 + (y—2)) dd. (2.58) 


We are left to verify that y; is continuous. Observe first that the domain 1?’ of y; is an 
open subset of R?”*'. Indeed, if (t,2, y) € 0’ then (t,) and (t, y) € Q which implies by 
the openness of 2 that there is ¢ > 0 such that the balls B ((t,x),e) and B((t,y),¢) in 
R”*? are contained in Q. Assuming the norm in all spaces in question is the oo-norm, we 
obtain that B((t,2,y),¢) C Q'. The continuity of y; follows from the following general 
statement. 


Lemma 2.16 Let f (A,u) be a continuous real-valued function on |a,b| x U where U is 
an open subset of R*, \ € [a,G] andu€ U. Then the function 


eu)=f Fwd 


is continuous inue€e U. 
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Proof of Lemma 2.16. Let {u,}72, be a sequence in U that converges to some 
u € U. Then all uz with large enough index k are contained in a closed ball B (u,e) C U. 
Since f (A, u) is continuous in [a, b] x U, it is uniformly continuous on any compact set in 
this domain, in particular, in [a,b] x B (u,e). Hence, the convergence 


f A, ur) — f A,u) as k > 00 


is uniform in A € [0,1]. Since the operations of integration and the uniform convergence 
are interchangeable, we conclude that y (ux) — y (wu), which proves the continuity of y. 
The proof of Lemma 2.15 is finished as follows. Consider f,, (t,2+A(y—.2)) asa 
function of (A, t,x, y) € [0,1] x Q’. This function is continuous in (A, t,x, y), which implies 
by Lemma 2.16 that also y; (t,x, y) is continuous in (t, x, y). 
Finally, if « = y then f,, (t,7 + A(y — x)) = fs, (t,2) which implies by (2.58) that 


Yi (i, 2, x) = See; (t, x) 


and, hence, y (t,2,x) = fy (t,x), that is, (2.57). = 

Now we are in position to prove Theorem 2.14. 

Proof of Theorem 2.14. In the main part of the proof, we show that the partial 
derivative O,,x exists. Since this can be done separately for any component s;, in this part 
we can and will assume that s is one-dimensional (that is, m = 1). 

Fix some (t,, 5.) € U and prove that O,x exists at this point. Since the differentiability 
is a local property, we can restrict the domain of the variables (t,s) as follows. Choose 
(a, 3] to be any interval in J,,containing both tp) and t,. By Theorem 2.11, for any « > 0 
there is 6 > 0 such that the rectangle (a, 3) x (s, — 6,5, +6) is contained in U and, for 
all ¢ €'(5, — 0,8, +0), 


sup ||z (t,s) — x(t, s,)|| <e. 
tE(a,B) 


SA 
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Besides, by the openness of 2, ¢ and 6 can be chosen so small that the following 
condition is satisfied: 


Q := {(t, 2, s) EeR™*™? :a<t <Q, |lc-—z(t,s)||<e, |s—s|<d} CO 


(cf. the proof of Theorem 2.11). In particular, for all t € (a, 8) and s € (s. — 0,84 +4), 
the solution x (t,s) is defined and (t, x (t,s),s) € Q. 


In what follows, we restrict the domain of the variables (t,x, s) to Q. Note that this 
domain is convex with respect to the variable (x, s), for any fixed t. Indeed, for a fixed t, 
x varies in the ball B (a(t, s,) ,¢) and s varies in the interval (s, — 6,5, +6), which are 
both convex sets. 

Applying the Hadamard lemma to the function f (t, 7, s) in this domain and using the 
fact that f is continuously differentiable with respect to (x, s), we obtain the identity 


FQ) F E62) = 90,05) (Yo a) + (he,039,3) (s—7), 


where y and w are continuous functions on the appropriate domains. In particular, 
substituting 0 = s,, 7 = x(t, s,) and y = x(t, s), we obtain 


f 01,3) 33)— fF Oe hSes5e) = OG2U,S)sS5n2 ,5)s3) @ Hs) — 2 Gs) 
+1) (t,x (t, Sx) , Se, v(t, 8), 3) (8 — 5x) 
= a(t,s)(x(t,s) —ax(t,s,)) + b(t, s) (s — s,), 


where the functions 
a(t,s) = p(t, x(t, 54), 54,0 (t,5),s) and b(t,s)= w(t, x(t, 5.),54,0(t,5),s) (2.59) 


are continuous in (t,s) € (a,8) x (s, — 6,5, +6) (the dependence on s, is suppressed 
because s, is fixed). 
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Set for any s € (s, — 6,5, +6) \ {s.} 


(boy = Dts) 


Sie) = = 


and observe that 
oe = (ts) a Se) f (2S) 8) oP Gab Sa) 5 Se) 
a S— 8, i S— Sy 
= Gt, 8) 24-0, 8) 


Note also that z(to,s) = 0 because both x (t,s) and x(t,s,) satisfy the same initial 
condition. Hence, function z(t, s) solves for any fixed s € (s, — 6,8, +6) \ {s.} the IVP 


(2.60) 


Since this ODE is linear and the functions a and 6 are continuous in (t,s) € (a,() x 
(s, — 6,5 +6), we conclude by Theorem 2.13 that the solution to this IVP exists for all 
s € (s, — 6,5. +6) and t € (a,3) and, by Theorem 2.11, the solution is continuous in 
(t, s) € (a, B) x (sx — 6, $4 +6). Hence, we can define z(t, 5) also at s = s, as the solution 
of the IVP (2.60). In particular, using the continuity of z(t, 5) in s, we obtain 

lim 20 3) 2 (Se) 


that is, 
— at, 8. . 
0.0 (tse) = lim eA Wee lim 2 (t, 8) =z. (6,8.) 


88x 8 — Sy 88x 


Hence, the derivative y (t) = 0.x (t, 5.) exists and is equal to z (t, s,), that is, y (t) satisfies 


the IVP 
{ y = a(t, 8s) ¢ 8 (se) 

Note that by (2.59) and Lemma 2.15 

a(t, 8x) = P(E, (E, 84) , 84,0 (t, 84) Se) = fo (t, © (t, 84) , Sx) 
and 

OE Suh tet, Sey See (Ss) ey) See Be) 
Hence, we obtain that y (t) satisfies the variational equation (2.52). 

To finish the proof, we have to verify that x (t, s) is continuously differentiable in (t, s). 

Here we come back to the general case s € R”. The derivative 0,2 = y satisfies the IVP 
(2.52) and, hence, is continuous in (t,s) by Theorem 2.11. Finally, for the derivative 0,7 


we have the identity 
Oe =f (6 8(058):59), (2.61) 


which implies that 0,2 is also continuous in (t,s). Hence, x is continuously differentiable 
in (t,s). ™ 


Remark. It follows from (2.61) that O,x is differentiable in s and, by the chain rule, 
O; (Ox) = O [f (t,x (t, 8), 8)] = fr (t, v(t, 5), 8) O.a + f, (t, 2 (t, 8), 8). (2.62) 
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On the other hand, it follows from (2.52) that 
0; (0,0) = 0:9 = fe (t,2(h 8) 58) 0.2 +f, (t@ (8) 98) (2.63) 


whence we conclude that 

0,0; = 0,0,x. (2.64) 
Hence, the derivatives 0, and 0; commute® on x. As we have seen above, if one knew the 
identity (2.64) a priori then the derivation of the variational equation (2.52) would have 


been easy. However, in the present proof the identity (2.64) comes after the variational 
equation. 


Theorem 2.17 Under the conditions of Theorem 2.14, assume that, for some k € N, 
f (t, 2,8) € C* (x, s). Then the maximal solution x (t,s) belongs to C* (s). Moreover, for 


any multiinder a of the order |a| < k and of the dimension m (the same as that of s), 
we have 


O;0 t=O: Om. (2.65) 
Here a = (Q1,...,Qm) where a; are non-negative integers, |ja| = a; +... + Qn, and 


Ja| 
OF @ 


5 Ost...0s%m° 


3 Linear equations and systems 
A linear (system of) ODE of the first order is a (vector) ODE of the form 
xv’ = A(t)x+ Bit) 


where A(t) : J — R"*”", B: I = R", and J being an open interval in R. If A(t) and B (t) 
are continuous in ¢ then, for any tp € J and 2» € R”, the IVP 


ot a aa (3.1) 


Xx (to) = Xo 
has a unique solution defined on the full interval J (cf. Theorem 2.13). In the sequel, 


we always assume that A(t) and B(t) are continuous on J and consider only solutions 
defined on the entire interval J. 


3.1 Space of solutions of homogeneous systems 


The linear ODE is called homogeneous if B (t) = 0, and inhomogeneous otherwise. In this 
Section, we consider a homogeneous equation, that is, the equation x’ = A(t) x. Denote 
by A the set of all solutions of this ODE. 


°The equality of the mixed derivatives can be concluded by a theorem from Analysis II if one knows 


that both 0,0, and 0,0,x are continuous. Their continuity follows from the identities (2.62) and (2.63), 
which prove at the same time also their equality. 
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Theorem 3.1 A is a linear space and dim A = n.Consequently, if x1, ...,%n aren linearly 
independent solutions to x' = A(t) « then the general solution has the form 


where C},...,C,, are arbitrary constants. 


Proof. The set of all functions J — R” is a linear space with respect to the operations 
addition and multiplication by a constant. Zero element is the function which is constant 
0 on J. We need to prove that the set of solutions A is a linear subspace of the space 
of all functions. It suffices to show that A is closed under operations of addition and 
multiplication by constant. 

If « and y € A then also x + y € A because 


(ety) =a2'+y =Ar+Ar=A(r+y) 


and similarly Ax € A for any \ € R. Hence, A is a linear space. 

Fix to € J and consider the mapping ® : A — R” given by ®(x) = x (to). This 
mapping is obviously linear. It is surjective since for any v € R” there is a solution x (t) 
with the initial condition x (to) = v. Also, this mapping is injective because x (to) = 0 
implies x(t) = 0 by the uniqueness of the solution. Hence, ® is a linear isomorphism 
between A and R”, whence it follows that dim.A = dimR” = n. 

Consequently, if x1, ...,7, are linearly independent functions from A then they form a 
basis in A. It follows that any element of A is a linear combination of 7,...,2,, that is, 
any solution to 2’ = A(t) has the form (3.2). m 

Consider now a scalar linear homogeneous ODE of the order n, that is, the ODE 


a + a, (tha) +... tan (t) 2 =0, (3.3) 


where all functions a; (t) are defined on an open interval J C R and are continuous on J. 
As we know, such an ODE can be reduced to the vector ODE of the 1st order as follows. 
Consider the vector function 


xO @O ae Osa") (3.4) 


so that 
y n—2 n-1 
Xp, =@, X=W,.., Xn_1 = x! ) x, =a, 


Then (3.3) is equivalent to the system 


are = X92 
X, = Xs 
U —— 

Xn-1 = Xn 

/ 
X, = TAatXn — AQX%n-1 — «+» — AnX1 
that is, 
x SA (3.5) 


where 


0 1 0 0 

0 0 1 0 
A= ves 

0 0 0 1 

An An—1 An-2 «+. TAL 


Since A (t) is continuous in t on J, we can assume that any solution x (t) of (3.5) is defined 
on the entire interval J and, hence, the same is true for any solution z (¢) of (3.3). 
Denote now by A the set of all solutions of (3.3) defined on J. 


Corollary. A is a linear space and dim. A = n.Consequently, if 1, ...,% are n linearly 
independent solutions to 2) + a, (t) a") + .... + ap (t) x = 0 then the general solution 
has the form 

6 (OS Cat (Eye Ca) 


where C1,...,Cy, are arbitrary constants. 

Proof. The fact that A is a linear space is obvious (cf. the proof of Theorem 3.1). 
The relation (3.4) defines a linear mapping from Ato A. This mapping is obviously 
injective (if x(t) = 0 then x(t) = 0) and surjective, because any solution x of (3.3) 
gives back a solution « (t) of (3.5). Hence, A and A are linearly isomorphic, whence 
dimA=dimA=n. = 


3.2 Linear homogeneous ODEs with constant coefficients 
Consider the methods of finding n independent solutions to the ODE 
2™) +a) +. +a, =0, (3.6) 


where j,...,@n, are real constants. 

It will be convenient to obtain the complex valued general solution x (t) and then to 
extract the real valued general solution. The idea is very simple. Let us look for a solution 
in the form x (t) = e* where \ is a complex number to be determined. Substituting this 
function into (3.6) and noticing that 2) = *e™, we obtain the equation for \ (after 
cancellation by e**): 

agg AE i Ba 0, 


This equation is called the characteristic equation of (3.6) and the polynomial P (\) = 
A" + a,A""! + .... + dy is called the characteristic polynomial of (3.6). Hence, if is the 
root of the characteristic polynomial then the function e’ solves (3.6). We try to obtain 
in this way n independent solutions. 


Theorem 3.2 If the characteristic polynomial P (A) of (3.6) has n distinct complex roots 
Aq, «An, then the following n functions e™,...,e%"* are linearly independent solutions of 
(3.6). Consequently, the general complex solution of (3.6) is given by 


a(t)=Cie™ +..4+C,e", 
where C; are arbitrary complex numbers. : 
If X=a+if is a non-real root of P(A) then X = a —if is also a root, and the func- 


tions e 


te in the above sequence can be replaced by the real-valued functions e™ cos Bt, 
e sin Gt. 
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Proof. Let us prove this by induction in n that the functions e**, ...,e*”' are linearly 


independent provided j,...,A, are distinct complex numbers. If n = 1 then the claim is 
trivial, just because the exponential function is not identical zero. Inductive step from 
n—1ton: Assume that, for some complex constants C),...,C;, and all t € R, 


Ce"? +... + Che = 0, (3.7) 


and prove that C; = ... = C,, = 0. Dividing (3.7) by e*’ and setting yw; = Aj — An, we 
obtain 
Oe) 4a OC -jeee £0. = 0. 


Differentiating in t, we obtain 
Cypyett® +... + Chapin _ehr-!* = 0. 


By the inductive hypothesis, we conclude that Cj; = 0 when by 4; 4 0 we conclude 
C; = 0, for all j =1,...,n — 1. Substituting into (3.7), we obtain also C,, = 0. 

Since the complex conjugations commutes with addition and multiplication of num- 
bers, the identity P (A) = 0 implies P (X) = 0 (since a, are real, we have a, = a,). Next, 
we have _ 

e™ = e“ (cost +isin Gt) and e™ = e™ (cos Bt — sin Gt) (3.8) 


so that e* and e* are linear combinations of e“ cos Gt and e™ sin 3t. The converse is true 
also, because 


1 ee 1 x. 
e™ cos Bt = 5 ie - ) and e® sin 6t = a (ce — e*) (3.9) 
dt 


Hence, replacing in the sequence e™"*, ....,e%* the functions e and e by e® cos Bt and 
e* sin Gt preserves the linear independence of the sequence. ™ 


Example. Consider the ODE 
x" — 32' +227 =0. 
The characteristic polynomial is P(\) = \? — 3A + 2, which has the roots \; = 2 and 


Az = 1. Hence, the linearly independent solutions are e?* and e’, and the general solution 
is Cye* + Coe’. 


Example. Consider the ODE x” +2 = 0. The characteristic polynomial is P (\) = \?+1, 
which has the complex roots A; = 7 and Az = —i. Hence, we obtain the complex solutions 
e” and e~“. Out of them, we can get also real linearly independent solutions. Indeed, 
just replace these two functions by their two linear combinations (which corresponds to 
a change of the basis in the space of solutions) 

eit 4 git cit — eit 


5 =cost and =o = sint. 


Hence, we conclude that cost and sint are linearly independent solutions and the general 
solution is C; cost + Co sint. 

Example. Consider the ODE 2” — x = 0. The characteristic polynomial is P(A) = 
hoe P(N ST) (7 + +1) that has the roots A; = 1 and A23 = —} + i. Hence, we 
obtain the three linearly independent real solutions 


and the real general solution is 


3 3 
Gie’s: ent (« Cos ey + C3 sin Fe] : 


What to do when P(A) has fewer than n distinct roots? Recall the fundamental the- 
orem of algebra (which is normally proved in a course of Complex Analysis): any poly- 
nomial P (A) of degree n with complex coefficients has exactly n complex roots counted 
with multiplicity. What is it the multiplicity of a root? If Xo is a root of P(A) then its 
multiplicity is the maximal natural number m such that P (A) is divisible by (A — o)”", 
that is, the following identity holds 


P(A) = (A— Ao)" Q(A), 


where @ (A) is another polynomial of . Note that P(A) is always divisible by 4 — Ao so 
that m > 1. The fundamental theorem of algebra can be stated as follows: if A4,..., A. 
are all distinct roots of P(A) and the multiplicity of A; is m,; then 


My +... + Mp =n 


and, hence, 
P(A) = (A= Ay) 2. (A= Ap)”. 
In order to obtain n independent solutions to the ODE (3.6), each root A; should give 


rise to m, independent solutions. 


Theorem 3.3 Let 1,...,A, be all the distinct complex roots of the characteristic polyno- 
mial P(A) with the multiplicities my, ...,m,, respectively. Then the following n functions 
are linearly independent solutions of (3.6): 


te bs Ge Ioan KO, cn, (3.10) 
Consequently, the general solution of (3.6) is 


r mij—-1 


t= SoS Oyler, (3.11) 


j=1 k=0 


where Cy; are arbitrary complex constants. 

IfX=a+i8 is a non-real root of P of multiplicity m, then A= a— if is also a root 
of the same multiplicity m, and the functions tke, t*e in the sequence (3.10) can be 
replaced by the real-valued functions t*e® cos Bt, te sin Bt, for any k = 0,...,m—1. 


Remark. Setting 


we obtain from (3.11) 


a(t)= 5— P; (te. (3.12) 


Hence, any solution to (3.6) has the form (3.12) where P; is an arbitrary polynomial of t 
of the degree at most m, — 1. 


Example. Consider the ODE x” — 22’ + x = 0 which has the characteristic polynomial 
PO) Shh = OST), 


Obviously, A = 1 is the root of multiplicity 2. Hence, by Theorem 3.3, the functions e! 
and te’ are linearly independent solutions, and the general solution is 


x (t) = (Ci + Cot) et. 


Example. Consider the ODE x” + a/Y — 22” — 22" 4+ 2' +4 = 0. The characteristic 
polynomial is 


PSPS Ho eee Aad) Oly 


Hence, the roots are \; = 1 with m, = 2 and Ag = —1 with mg = 3. We conclude that 
the following 5 function are linearly independent solutions: 


entee te, Pex: 
The general solution is 


w(t) = (Cy + Cot) ef + (C3 + Cat + Cot?) eo. 


Example. Consider the ODE x’ + 2x” + 2’ = 0. Its characteristic polynomial is 
P(A) = +28 +A=dA (N27 4$-:1)7 =A(AFi72(A-3)’, 


and it has the roots Ay = 0, Ag = 7 and A3 = —i, where A» and A3 has multiplicity 2. The 
following 5 function are linearly independent solutions: 


Le te”,.e, te (3.13) 
The general complex solution is then 
Cy + (Co + Czt) e” + (Ca + Cot) e™. 


Replacing in the sequence (3.13) e”’,e~” by cost,sint and te”,te~“ by tcost,tsint, we 
obtain the linearly independent real solutions 


1, cost, tcost, sint, tsint, 
and the general real solution 


Cr+ (C2 + C3t) cost + (C4 + Ct) sint. 
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We make some preparation for the proof of Theorem 3.3. Given a polynomial P (A) = 
agA" +a"! +... + a9 with complex coefficients, associate with it the differential operator 


a d n d nl 
P (=) ao @ T ay (=) ae cee | ao 


where we use the convention that the “product” of differential operators is the composi- 


tion. That is, the operator P (4) acts on a smooth enough function f (t) by the rule 


I 
g 
a] 


d 
(=) f =aof™ taf) +... + aof 


(here the constant term do is understood as a multiplication operator). 
For example, the ODE 


a” tae U4... +a,¢ =0 (3.14) 


d 


where P (\) = \” + a,\""! +... + dy is the characteristic polynomial of (3.14). 


can be written shortly in the form 


Example. Let us prove the following identity: 
d\n dt 
P mye Pi rye™. (3.15) 


It suffices to verify it for P(A) = \* and then use the linearity of this identity. For such 
P(X) =, we have 


ae eo Mt 
P Te eT Se =P yer 


which was to be proved. 


Lemma 3.4 Jf f(t), g(t) are n times differentiable functions on an open interval then, 
for any polynomial P of the order at most n, the following identity holds: 


d Sloe ad 
ss a sepa) | 
p(t) ia a? (=) > (3.16) 
Example. Let P(\) => +41. Then P’ (A) = 24 +1, P” = 2, and (3.16) becomes 


" / d / / d Wp" d 
(fg) +(f9) + fg = fP (q)orrr (q)or sf Pa (=) 
= f(g +g +9) +f (29 +9) + f'"9. 
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It is an easy exercise to see directly that this identity is correct. 


Proof. It suffices to prove the identity (3.16) in the case when P(\) = *, k < n, 
because then for a general polynomial (3.16) will follow by taking linear combination of 
those for \*. If P(A) = * then, for j < k 


P® =k(k—-1)...(k-j+ D9 


and PY) = 0 for j > k. Hence, 


w (2 ay" 
PY | — = k(k—-1)...(k-—79+1)(— i<k 
(G) = R&D b-F4+(G) 5<h 
fad 
DOI ee ee 

(=) 0, J>R, 


and (3.16) becomes 


k : k 
=r k(k—1) _ TITY oi) gla) = S- (‘) fag), (3.17) 
- | 


J j=0 


The latter identity is known from Analysis and is called the Leibniz formula’. 


Lemma 3.5 A complex number X is a root of a polynomial P with the multiplicity m if 
and only if 
P(X) =0 for all k =0,...,m—1 and P™ (A) £0. (3.18) 


Proof. If P has a root A with multiplicity m then we have the identity for all z € C 
P(z) = (z-A)" Q(z) 


where Q is a polynomial such that Q (A) 4 0. For any natural k, we have by the Leibniz 
formula 


k 
P= >> (8) GP GG). 
If k <_m then also 7 < m and 
((z —d)") = const (z — d)", 


which vanishes at z = \. Hence, for k < m, we have P“) (\) = 0. For k = m we have 
again that all the derivatives ((z — \)")" ) vanish at z = \ provided j < k, while for j = k 
we obtain 


((2- Ay") = (2 - AN”) =m! £0. 


Hence, 


PIM A) = ((2- A)" QO) £0. 
"If k = 1 then (3.17) amounts to the familiar product rule 
(fo) = fig t fa. 


For arbitrary k € N, (3.17) is proved by induction in k. 
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Conversely, if (3.18) holds then by the Taylor formula for a polynomial at ’, we have 


U (n) 
P(z) = P(A)+ ry (z-A)+..+ a ae (z—)" 
(m) (n) 
2 P a) (ayn P ae (2d) 
= (z—- A)" Q(z) 
where 
(m) (m+1) (n) 
Q(2Z)= e ~ ee nt GA) Past a2 (z-A)"™. 


Obviously, Q (A) = ee # 0, which implies that \ is a root of multiplicity m. 


Lemma 3.6 /f \;,...,A, are distinct complex numbers and if, for some polynomials P; (t), 
S "Pj (the =0 for allt ER, (3.19) 


then P; (t) = 0 for all j. 


Proof. Induction in r. If r = 1 then there is nothing to prove. Let us prove the 
inductive step from r—1 to r. Dividing (3.19) by e*’ and setting 4; = A; — A,, we obtain 
the identity 


SA t) es + P. (t) =0. (3.20) 


Choose some integer k > deg P. ieee deg P as the maximal power of t that enters P 
with non-zero coefficient. Differentiating the above identity k times, we obtain 


where we have used the fact that (P. \” = 0 and 
k 
(P; (4) et") = Q; (tel 


for some polynomial Q; (this for example follows from the Leibniz formula). By the 
inductive hypothesis, we conclude that all Q; = 0, which implies that 
(Pres) =0. 


Hence, the function P;e“i* must be equal to a polynomial of the degree at most k, which 
is only possible if P; = 0. Substituting into (3.20), we obtain that also P.=0. m 

Proof of Theorem 3.3. Let P(A) be the characteristic polynomial of (3.14). We 
first prove that if A is a root of multiplicity m then the function te solves (3.14) for any 
k =0,...,m—1. By Lemma 3.4, we have 


d kt) “1 k\) pi) d At 
p (=) (te man ie aye 


= a (8)? P® (r) 


=o J 
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If j > k then the (th) = 0. If j < k then j < m and, hence, P” (\) = 0 by hypothesis. 
Hence, all the terms in the above sum vanish, whence 


d 
P k At = 
(=) (t € ) 0, 
that is, the function x (t) = te’ solves (3.14). 


If A1, ..., Ar are all distinct complex roots of P (A) and m, is the multiplicity of A; then 
it follows that each function in the following sequence 


te eects oe Up nat als (3.21) 


is a solution of (3.14). Let us show that these functions are linearly independent. Clearly, 
each linear combination of functions (3.21) has the form 


r mj-1 r 
SS Gite SS FR Oe” (3.22) 
j=l k=0 j=l 
where P; (t) = 77% : Cjxt* are polynomials. If the linear combination is identical zero 


then by Lemma 3.6 P; = 0, which implies that all C;;, are 0. Hence, the functions (3.21) 
are linearly independent, and by Theorem 3.1 the general solution of (3.14) has the form 
(8:22), 

Let us show that if X = a+ is a complex (non-real) root of multiplicity m then 
\ = a—if is also a root of the same multiplicity m. Indeed, by Lemma 3.5, X satisfies the 
relations (3.18). Applying the complex conjugation and using the fact that the coefficients 
of P are real, we obtain that the same relations hold for instead of \, which implies 
that \ is also a root of multiplicity m. Z 

The last claim that every couple t*e™’, t*e*’ in (3.21) can be replaced by real-valued 
functions t*e™ cos Bt, t*e® sin Bt, follows from the observation that the functions t*e™ cos Bt, 
t*e* sin Bt are linear combinations of t*e*, t*e*’, and vice versa, which one sees from the 
identities 

e™ cos Bt = ; G + e) , ev sin Bt = - (e% — e*) : 


et — e% (cos Bt + isin At), ev = ew (cos Gt — isin Bt) , 
multiplied by t* (compare the proof of Theorem 3.2). 


3.3. Space of solutions of inhomogeneous systems 
Consider now an inhomogeneous linear ODE 
x! = A(t)x + B(t), (3.23) 


where A(t): J — R"*" and B(t) : J — R” are continuous mappings on an open interval 
ICR. 


Theorem 3.7 If xo (t) is a particular solution of (3.23) and x, (t),..., Un (t) is a sequence 
of n linearly independent solutions of the homogeneous ODE x’ = Ax then the general 
solution of (3.23) is given by 


x (t) = xo (t) + Crzi (t) +... HCntn (t). (3.24) 
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Proof. If x (t) is also a solution of (3.23) then the function y (t) = x (t) — 29 (t) solves 
y’ = Ay, whence by Theorem 3.1 


and x (t) satisfies (3.24). Conversely, for all C),...C,, the function (3.25) solves y/ = Ay, 
whence it follows that the function x (t) = 2 (t) + y (t) solves (3.23). m 
Consider now a scalar ODE 


a™ + a, (t)a) +... +an (t)a = f (t) (3.26) 


where all functions a1, ...,a,, f are continuous on an interval J. 


Corollary Jf xo (t) is a particular solution of (3.26) and x, (t),...,%n (t) is a sequence of 
n linearly independent solutions of the homogeneous ODE 


a + a, (t)a-) +... tan (t)2 = 0, 
then the general solution of (3.26) is given by 


ot): = fot) AR Ox (b) a oe Ota l(b): 


The proof is trivial and is omitted. 


3.4 Linear inhomogeneous ODEs with constant coefficients 
Here we consider the ODE 
a” +a) +... +a, = f (4), (3.27) 


where the function f (t) is a quasi-polynomial, that is, f has the form 
FH) = SOR, He" 
J 


where R; (¢) are polynomials, j; are complex numbers, and the sum is finite. It is obvious 
that the sum and the product of two quasi-polynomials is again a quasi-polynomial. 
In particular, the following functions are quasi-polynomials 


tke cosBt and t*e™ sin Bt 
(where k is a non-negative integer and a, 3 € R) because 


iBt 1 ppt iBt _ -ift 
cos bt = eaves and sin 6t = oes 
2 20 
As we know, the general solution of the inhomogeneous equation (3.27) is obtained as 
a sum of the general solution of the homogeneous equation and a particular solution of 
(3.27). Hence, we focus on finding a particular solution of (3.27). 
As before, denote by P (A) the characteristic polynomial of (3.27), that is 


P(A) =A +arAX 14+... 4+ ap. 
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Then the equation (3.27) can be written shortly in the form P (4) x = f, which will be 
used below. We start with the following observation. 
Claim. If f =cfit+...tcnf, and x (t),..., 2% (t) are solutions to the equation P (4) Lt; = 
fj, then x = C141 +... + CeLp solves the equation P (4) fee 

Proof. This is trivial because 


d d d 
J J J 
. 
Hence, we can assume that the function f in (3.27) is of the form f(t) = R(t) e! 
where F(t) is a polynomial. 
To illustrate the method, which will be used in this Section, consider first the following 
example. 


Example. Consider the ODE 
P (=) oe" (3.28) 


where j is not a root of the characteristic polynomial P(A) (non-resonant case). We 
claim that (3.28) has a particular solution in the form x (t) = ae’ where a is a complex 
constant to be chosen. Indeed, we have by (3.15) 


(i) te = Pines 


whence F 
P (=) (ae“*) = elt 
provided 
1 


Consider some concrete examples of ODE. Let us find a particular solution to the 
ODE 
ge 4g gS 


Note that P(A) = 2? +2 +1 and py = 1 is not a root of P. Look for a solution in the 
form x (t) = ae’. Substituting into the equation, we obtain 


ae’ + 2ae' + ae’ = e' 
whence we obtain the equation for a: 
4a=1l1,a=- 


Alternatively, we can obtain a from (3.29), that is, 


ae Ce | 
P(w) 14241 4 


a= 
Hence, the answer is x (t) = Je’. 


4 
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Consider another equation: 
x" +2¢'+a=sint (3.30) 
Note that sint is the imaginary part of e”. So, we first solve 
a” +22’ +a = et 


and then take the imaginary part of the solution. Looking for a solution in the form 
x (t) = ae’, we obtain 


1 1 1 a 
— =— = = _ 
Pi) 22 sP 2e dl: ~ 24 2 
Hence, the solution is 
= —~et = ee ee aa deat 
a ee a — 9 2 
Therefore, its imaginary part « (t) = —4 cost solves the equation (3.30). 


Consider yet another ODE 
x” +22 +a” =e' cost. (3.31) 
Here e~‘ cost is a real part of e“’ where pp = —1 +i. Hence, first solve 
ge 4 +a = el. 


Setting x (t) = ae”, we obtain 


Hence, the complex solution is x (t) = —e~9! = —e~' cost — ie~ sint, and the solution 
to (3.31) is x (t) = —e~* cost. 
Finally, let us combine the above examples into one: 


x" + 2a + 2 = 2e' — sint +e‘ cost. (3.32) 


A particular solution is obtained by combining the above particular solutions: 


1 1 
2 (=<) — (-3 cost) + (-e* cos t) 


en ae ee 
= 7e = cost —e Cost. 
a 


x (t) 


Since the general solution to the homogeneous ODE x” + 22’ + x = 0 is 
x (t) = (C, ole Cot) e*, 


we obtain the general solution to (3.32) 


1 1 
x(t) =(Ci+Cot)e* + xe + 5 cost — e ‘cost. 
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Consider one more equation 
gS gk Aap es", 


This time ps = —1 is a root of P(A) = \7 +2+ 1 and the above method does not work. 
Indeed, if we look for a solution in the form x = ae~ then after substitution we get 0 in 
the left hand side because e~‘ solves the homogeneous equation. 

The case when jz is a root of P(A) is referred to as a resonance. This case as well as 
the case of the general quasi-polynomial in the right hand side is treated in the following 
theorem. 


Theorem 3.8 Let R(t) be a non-zero polynomial of degree k > 0 and yw be a complex 


number. Let m be the multiplicity of pw if is a root of P andm = 0 if ts is not a root of 
P. Then the equation 


has a solution of the form 

x(t) =t™Q (i) eM, 
where Q (t) is a polynomial of degree k (which is to be found). 
Example. Come back to the equation 

ge +2e' +2=e7%. 


Here ys = —1 is a root of multiplicity m = 2 and R(t) = 1 is a polynomial of degree 0. 
Hence, the solution should be sought in the form 


nt) Sare” 


where a is a constant that replaces Q (indeed, @ must have degree 0 and, hence, is a 
constant). Substituting this into the equation, we obtain 


a ((@en)" 2 (t?e~")’ + Pe) =e % (3.33) 
Expanding the expression in the brackets, we obtain the identity 
er)" +2 (er) + Pet = Qe*, 
so that (3.33) becomes 2a = 1 and a = 4. Hence, a particular solution is 
1 
a) = xf e 
Consider one more example. 


ov" +2c' +a = tet 
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with the same » = —1 and R(t) = t. Since deg R = 1, the polynomial Q must have 
degree 1, that is, Q(t) = at +b. The coefficients a and 6 can be determined as follows. 
Substituting 

a(t) = (at +b) te = (at? + bt?) e* 


into the equation, we obtain 
v" +c’ +a = ((at? + bt?) eo + 2 ((at? + bt?) eo + (at? + bt?) e* 
(2b + Gat) e~. 
Hence, comparing with the equation, we obtain 
2b+ 6at =t 
so that b= 0 anda= z. The final answer is 


a 
x (t) = rie 


Proof of Theorem 3.8. Let us prove that the equation 
d 
P (=) z = R(t)e™ 


no) One 
where m is the multiplicity of 4 and deg Q = k = deg R. Using Lemma 3.4, we have 
d d 1 fd 
—)rc = P(—) (tQ(He") =S°=(t” @) p@) ( — | ett 
p(Z)e = P(Z) mower Ejay Po (F< 


SQ (A) POH (uy) et (3.34) 


‘a 
joo 7 


has a solution in the form 


By Lemma 3.4, the summation here runs from 7 = 0 to 7 = n but we can allow any 
j > 0 because for j > n the derivative PY is identical zero anyway. Furthermore, since 
PD (2) = 0 for all 7 < m—1, we can restrict the summation to j > m. Set 


y (t) = (Q(t) (3.35) 


and observe that y(t) is a polynomial of degree k, provided so is Q(t). Conversely, for 
any polynomial y (t) of degree k, there is a polynomial Q (t) of degree k such that (3.35) 
holds. Indeed, integrating (3.35) m times without adding constants and then dividing by 
t™, we obtain Q(t) as a polynomial of degree k. 

It follows from (3.34) that y must satisfy the ODE 


Pm) Pmt) Piri) 


m! (m+ 1)! (m+ 1)! 
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which we rewrite in the form 


boy + bry! +... + by +... = R(t) (3.36) 


es } (in fact, the index 7 in the left hand side of (3.36) can be restricted 


toi<ksince y =0 for i> k). Note that 


where b; = 


by = ees) #0. (3.37) 


m! 


Hence, the problem amounts to the following: given a polynomial 
R(t) =rot® + ryt? 1 +...4+75 


of degree k, prove that there exists a polynomial y (t) of degree k that satisfies (3.36). Let 
us prove the existence of y by induction in k. 

The inductive basis. If k = 0, then R(t) = rp and y(t) = a, so that (3.36) becomes 
aby = 19 whence a = ro/bo (where we use that bo 4 0). 

The inductive step from the values smaller than k to k. Represent y in the from 


y = at" + z(t), (3.38) 


where z is a polynomial of degree < k. Substituting (3.38) into (3.36), we obtain the 
equation for z 


boz 4 by 2! ea bz bi. = R(t) = (abot + aby (t*)' + + aby (")) =: Rit). 


Choosing a from the equation aby = ro we obtain that the term t* in the right hand side 
of (3.38) cancels out, whence it follows that R(t) is a polynomial of degree < k. By the 
inductive hypothesis, the equation 


boz thie +... +029 +... = R(t) 


has a solution z(t) which is a polynomial of degree < k. Hence, the function y = at* + z 
solves (3.36) and is a polynomial of degree k. m 


Remark. If k = 0, that is, R(t) = ro is a constant then (3.36) yields 


To m'ro 
2 by POM (yp) 
The equation (3.35) becomes 
= (my _mlro 
t t = 
("Q()) = Beare 


whence after m integrations we find 


To 


2) = Bom Gay’ 
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Therefore, the ODE P (4) x = roe has a particular solution 
To t 
oS Ft eC. 3.39 


Example. Consider again the ODE x” + 22’ + « = e'. Then ps = —1 has multiplicity 
m = 2, and R(t) =1. Hence, by the above Remark, we find a particular solution 


1 
eh) = P"(=1) 


Pet= dae, 
2, 


3.5 Second order ODE with periodic right hand side 


Consider a second order ODE 
a" + pa! + qx = f(t), (3.40) 


which occurs in various physical phenomena. For example, (3.40) describes the movement 
of a point body of mass m along the axis x, where the term px’ comes from the friction 
forces, gx - from the elastic forces, and f (t) is an external time-dependant force. Another 
physical situation that is described by (3.40), is an electrical circuit: 


Vit) *, iL 


As before, let R the resistance, L be the inductance, and C' be the capacitance of the 
circuit. Let V (t) be the voltage of the power source in the circuit and x (t) be the current 
in the circuit at time t. Then we have seen that the equation for 2 (t) is 


x 
La" + Rr’ +. =V'. 
LZ + RE + ‘ai 
If L > 0 then dividing by L we obtain an ODE of the form (3.40). 
As an example of application of the above methods of solving such ODEs, we investi- 


gate here the case when function f (t) is periodic. More precisely, consider the ODE 


x" + pa’ + qx = Asinut, (3.41) 
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where A,w are given positive reals. The function Asin wt is a model for a more general 
periodic force, which makes good physical sense in all the above examples. For example, 
in the case of electrical circuit the external force has the form A sin wt if the power source 
is an electrical socket with the alternating current (AC). The number w is called the 
frequency of the external force (note that the period = a) or the external frequency, and 
the number A is called the amplitude (the maximum value) of the external force. 
Assume in the sequel that p > 0 and q > 0, which is physically most interesting case. 
To find a particular solution of (3.41), let us consider the ODE with complex right hand 
side: 
a” + pa’ + qx = Ae™. (3.42) 


Consider first the non-resonant case when iw is not a root of the characteristic polynomial 
P(A) = »° + p\+q. Searching the solution in the from ce’ , we obtain 


oa So: eee 
“~P(w) —wtpwtq : 


and the particular solution of (3.42) is 
(a + ib) e = (acoswt — bsinwt) +4 (asinwt + bcoswt). 


Taking its imaginary part, we obtain a particular solution to (3.41) 


x(t) =asinwt + bcoswt = Bsin (wt + y) (3.43) 
where F 
B=V27+ = |cl = ———. (3.44) 


(q — w?)? + wp? 
and y € [0, 27) is determined from the identities 
ea ee 
cosp= 5, sing= zp. 
The number B is the amplitude of the solution and ¢ is the phase. 
To obtain the general solution to (3.41), we need to add to (3.43) the general solution 


to the homogeneous equation 
xv" +pe' + qr =0. 


Let A; and Az are the roots of P (A), that is, 


Pp p 
Mo = art yee ‘ 
1,2 5 q 


Consider the following possibilities for the roots. 
A, and 2 are real. Since p > 0 and q > 0, we see that both A; and 2 are strictly 
negative. The general solution of the homogeneous equation has the from 


Ce 4 Ge if At x da, 
(Getepe™ aby = 2s: 
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In the both cases, it decays exponentially in t as t — +oo. Hence, the general solution of 
(3.41) has the form 


x(t) = Bsin (wt + vy) + exponentially decaying terms. 


As we see, when t — oo the leading term of x(t) is the above particular solution 
Bsin(wt+y). For the electrical circuit this means that the current quickly stabilizes 
and becomes also periodic with the same frequency w as the external force. 


A, and A» are complex. 
Let A1,2 = a +78 where 


2 
a=—p/2<0 and B=\/q-2>0. 


The general solution to the homogeneous equation is 


e™ (C, cos Bt + C2 sin Gt) = Ce“ sin (Bt +7). 


The number ( is called the natural frequency of the physical system in question (pendu- 
lum, electrical circuit, spring) for the obvious reason - in absence of the external force, 
the system oscillate with the natural frequency (. 


Hence, the general solution to (3.41) is 
z(t) = Bsin(wt + y) +Ce™ sin (St +). 


If a < 0 then the leading term is again Bsin(wt+y). Here is a particular example of 
such a function: sint + 2e~“/4 sin rt 


A, and are purely imaginary, that is, a = 0. In this case, p = 0, q = 6”, and the 
equation has the form 
x" + 62 = Asinuwt. 


The assumption that iw is not a root implies w # G. The general solution is 
z(t) = Bsin(wt+ yp) +Csin(6t+ y), 
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which is the sum of two sin waves with different frequencies - the natural frequency and 
the external frequency. Here is a particular example of such a function: sin t + 2 sin zt: 


y 


-2.5 7 


Strictly speaking, in practice such electrical circuits do not occur since the resistance is 
always positive. 

Let us come back to the formula (3.44) for the amplitude B and, as an example of 
its application, consider the following question: for what value of the external frequency 
w the amplitude B is maximal? Assuming that A does not depend on w and using the 
identity 

a. A? 
wt + (p? — 2g) w? + 9?’ 
we see that the maximum B occurs when the denominators takes the minimum value. If 
p* > 2q then the minimum value occurs at w = 0, which is not very interesting physically. 
Assume that p? < 2q (in particular, this implies that p? < 4q, and, hence, \; and 22 are 
complex). Then the maximum of B occurs when 


The value 
wo = fq —p?/2 
is called the resonant frequency of the physical system in question. If the external force 
has the resonant frequency then the system exhibits the highest response to this force. 
This phenomenon is called a resonance. 
Note for comparison that the natural frequency is equal to 8 = \/q — p?/4, which is 
in general different from wo. In terms of wo and 3, we can write 


‘ AC A 
A = £8,322 27 7 9 2 ; 1 
a wow +q° (w? — ws) +g? — WG 
A2 


(uh) +P 


where we have used that 
2\ 2 4 
Pp Pp 
g —wy= 9 - («- r) =qp = =P 0. 
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In particular, the maximum amplitude that occurs when w = wo is Bmax = + 
In conclusion, consider the case, when iw is a root of P (A), that is 
(iw)? + piw+q=0, 


which implies p = 0 and g = w?. In this case a = 0 and w = wo = 8 = /¢@, and the 
equation has the form 
"+ we = Asinut. 


Considering the ODE . 
gl! 4 wer = Ag 


and searching a particular solution in the form zx (t) = cte“’, we obtain by (3.39) 


A A 


c= Se FS 


Pi (iw) 2iw’ 
Hence, the complex particular solution is 


Al, a At gee in 
= ——e = —i— co — sin 
Pie eo age 


x (t) 


and its imaginary part is 


At 
g(t) = oe cos wt. 
WW 


Hence, the general solution is 
At . 
r= = coswt + C'sin (wt +). 


Here is an example of such a function: —tcost + 2sint 


¥ 


Hence, we have a complete resonance: the external frequency w is simultaneously equal 
to the natural frequency and the resonant frequency. In the case of a complete resonance, 
the amplitude increases in time unbounded. Since unbounded oscillations are physically 
impossible, either the system breaks down over time or the mathematical model becomes 
unsuitable for describing the physical system. 
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3.6 The method of variation of parameters 
3.6.1 A system of the 1st order 


We present here the method of variation of parameters in order to solve a general linear 
system 
x = A(t)a+ B(t) 


where as before A(t) : J — R"*” and B(t) : J — R” are continuous. Let 2; (t),..., 2p (t) 
be n linearly independent solutions of the homogeneous system 2’ = A(t) x, defined on 
I. We start with the following observation. 


Lemma 3.9 Jf the solutions x (t),...,%n (t) of the system x’ = A(t) x are linearly inde- 
pendent then, for any to € I, the vectors x1 (to) ,...;n (to) are linearly independent. 


Proof. Indeed, assume that for some constant C},....,Ch 


Consider the function x (t) = Cia (t) +... + Cha, (t). Then x (t) solves the IVP 


whence by the uniqueness theorem x (t) = 0. Since the solutions 11, ..., 7, are independent, 
it follows that C) = ... = C;, = 0, whence the independence of vectors 2 (to) , ..., Ln (to) 
follows. 


Example. Consider two vector functions 


cost sint 
a) ea pada y= ee 


which are obviously linearly independent. However, for t = 7/4, we have 


so that the vectors x; (7/4) and x (7/4) are linearly dependent. Hence, x (t) and 22 (t) 
cannot be solutions of the same system 2’ = Ax. 
For comparison, the functions 


cos t —sint 
as a4 pea ( cos t ) 


are solutions of the same system 
oS ve ¢ 
SANE, alle hoon 


and, hence, the vectors x (t) and x(t) are linearly independent for any t. This follows 
also from 


det (x1 | x2) = det ( cn oe ) = TAO: 


sin cost 
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Given n linearly independent solutions to x’ = A(t) x, form an x n matrix 
X (t) = (a1 (f) | #2 (f) |---| en () 


where the k-th column is the column-vector 2; (t), k = 1,...,n. The matrix X is called 
the fundamental matrix of the system x’ = Ax. 

It follows from Lemma 3.9 that the column of X (t) are linearly independent for any 
t € I, which in particular means that the inverse matrix X~' (t) is also defined for all 
t € I. This allows us to solve the inhomogeneous system as follows. 


Theorem 3.10 The general solution to the system 
ze =A(t)x+B(t), (3.45) 


is given by 
z(t) =X (t) / X71 (t) B(t) dt, (3.46) 


where X is the fundamental matrix of the system a’ = Ax. 


Note that X~'B is a time dependent n-dimensional vector, which can be integrated 
in ft componentwise. 
Proof. Observe first that the matrix X satisfies the following ODE 


X'= AX. 


Indeed, this identity holds for any column x; of X, whence it follows for the whole matrix. 
Differentiating (3.46) in t and using the product rule, we obtain 


a = XH) [XO HBOHa+XH (X71 HBO) 
= AX [XB (a+ Bi) 
= Av+B(t). 


Hence, x (t) solves (3.45). Let us show that (3.46) gives all the solutions. Note that the 
integral in (3.46) is indefinite so that it can be presented in the form 


[xX @BOd=VH+e, 


where V (t) is a vector function and C = (Cj,...,C;,) is an arbitrary constant vector. 
Hence, (3.46) gives 
x(t) X(tHVE\)H+EX (EHC 


where 2p (t) = X (t) V (t) is a solution of (3.45). By Theorem 3.7 we conclude that x (t) 
is indeed the general solution. m 
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Second proof. Let us show a different way of derivation of (3.46) that is convenient 
in practical applications and also explains the term “variation of parameters”. Let us 
look for a solution to (3.45) in the form 


x (t) = Cy (t) a (t) +... + Ch (t) an (8) (3.47) 


where C,Co,..,C;, are now unknown real-valued functions to be determined. Since 
x1 (t),..., Z(t) are for any t linearly independent vectors, any R"-valued function x (t) 
can be represented in the form (3.47). The identity (3.47) can be considered as a lin- 
ear system of algebraic equations with respect to the unknowns C},...,C,. Solving it by 
Cramer’s rule, we obtain C),...,C;, in terms of rational functions of 71, ..., 2,2”. Since the 
latter functions are all differentiable in t, we obtain that also C/,...,C, are differentiable 
in t. 
Differentiating the identity (3.47) in time and using 2, = Ax,, we obtain 


gg = Cit + Cox lee a Cie 


Cyt, + Chara +... +O tn 

= C,Ar, + CyArg +...+C, Ar, 
Cia t+ Cota t+... + Chan 

= ArtCar, + Chrg+...+ Chan. 


Hence, the equation x’ = Ar + B becomes 
Cia1 + Chag +... + Chan = B. (3.48) 


If C(t) denotes the column-vector with components C} (t),...,C, (t) then (3.48) can be 
written in the form 


XC =B 
whence 
C' = XB, 
Cys fae (t) B (t) dt, 
and 
Go — ACH) fx (t) B (t) dt. 
rT] 
The term “variation of parameters” comes from the identity (3.47). Indeed, if Cy, ...., Cn 


are constant parameters then this identity determines the general solution of the homoge- 
neous ODE wv’ = Az. By allowing C},...,C;, to be variable, we obtain the general solution 
to a’ = Ar+ B. 


Example. Consider the system 


or, in the vector form, 


It is easy to see that this system has two independent solutions 


cost —sint 
vi) ( sint ) anda Ml ( cost ) 


Hence, the corresponding fundamental matrix is 


cost —sint 
sint cost 


and 


—sint cost 


x=( 
x= ( cost 2) 


Consider now the ODE 
xv’ = A(t)x+ Bit) 


bp (t) 
= cost —sint cost sint by (t) c 
= sint cost }. —sint cost by (t) 
cost —sint by; (t) cost + by (t) sint dt 
sint cost }. —b; (t) sint + bg (t) cost : 
Consider a particular example B (t) = ( “t f ) Then the integral is 


cost —tsint tcost + C; 
d dt = . ; 
—sint —tcost —tsint + C2 


oe cost —sint tcost + C; 
ye sint cost —tsint+ Cy 
C cost — Cosint+t 

C sint + Cy cost 


—sint 
) +Cr( cost ) ; 


where B (t) = ( Dit) ) By (3.46), we obtain the general solution 


whence 


II II 
Og, 
oc 
Ne 
+ 
Q 
iis 
on 
sao 
Cho 


3.6.2. A scalar ODE of n-th order 
Consider now a scalar ODE of order n 
ao) + a, (t)a®") +. tan (t) a2 = f (2), 


where a; (t) and f (t) are continuous functions on some interval J. Recall that it can be 
reduced to the vector ODE 
x’ = A(t)x+ Bit) 
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where 


/ 
x(ty={ 7% 
gp (nl) (t) 
and 
0 1 0 0 0 
0 0 1 0 0 
A= mr ie oe x2 and B= 
0 0 0 ve UL ie 
An An-1 An-2 «+. TA, f 


If %1,...,U%, are n linearly independent solutions to the homogeneous ODE 
a™ + aye) +... +n (t)a =0 


then denoting by x1, ...,X, the corresponding vector solution, we obtain the fundamental 
matrix 


Ly 2 eee Zn 

zi, x Be 0h 

X= (x1 | x2] [n=] UP OF 
of") af) a gD 


We need to multiply X~' by B. Denote by y;, the element of X~! at position i,k 
where 7 is the row index and k is the column index. Denote also by y, the k-th column 


Yk 
of X—1, that is, y, = .. |. Then 
Unk 
; Yu +. Yin 0 Yint 
XVB= ie at) ae 2 = $y = Ua: 


and the general vector solution is 
x= X(t) | F(t) (at. 


We need the function x (t) which is the first component of x. Therefore, we need only to 
take the first row of X to multiply by the column vector [ f (t) yn (t) dt, whence 


r= doy f F(t) yin ee 
j=l ‘ 
Hence, we have proved the following. 
Corollary. Let x1,...,%» be n linearly independent solution to 


a™ 4 a, (tha) +... tan (ft) =0 


and X be the corresponding fundamental matriz. Then, for any continuous function f (t), 
the general solution to the ODE 


a™ + ay (t) ge AY ae ose (t)x = f(t) 
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is given by . 
r= S20 f FO un Oat (3.49) 


where yj, are the entries of the matrix X~. 


Example. Consider the ODE 
go +2 =sint 


The independent solutions are x (t) = cost and 2 (t) = sint, so that 
i cos t sint 
—sint cost 


xe cost —sint 
~ \ sint cost 


The inverse is 


Hence, the solution is 


or) = att) [FO me (ats x2) f F(t) yea at 


= cost [ sint (- sin t) at+sint [ sint costa 


1 
= -—cost / sin? tdt + 3 sint / sin 2tdt 


1 1 1 
—cost (5 — sin 2t + C1) a qsint(- cos 2t + C2) 


1 1 
= ~atcost + Z (sin 2t cost — sint cos 2t) + C3 cost + Cy sint 


1 
= pt COREE C3 cost + Cs sin t. 


Of course, the same result can be obtained by Theorem 3.8. 
Consider one more example, when the right hand side is not a quasi-polynomial: 


x" +a = tant. (3.50) 


Then as above we obtain® 


x 


cost / tant (—sint) dt + sint / tan t cos tdt 


1—sint 
n (=) + sint) — sintcost + C, cost + Co sint 


lI 

fo) 

ie) 

nN 

on 
gor 
oo ee 

es 


1 1—sint 
5 cose ln (as) + Ci cost + Cg sint. 


’The intergal f tan x sin tdt is taken as follows: 


» 2 9: 
s ib 1 — cost dt 
J tencesin tdt = / on dt = / ee dt = / sin t. 
cost cost cost 
/ dt /=> / dsint 1, 1—sint 
= = =—ln ; 
cost cos? t 1—sin?t 2 1+sint 
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Next, we have 


Let us show how one can use the method of variation of parameters directly, without 
using the formula (3.49). Consider the ODE 


ve +a= f(t). (3.51) 
The general solution to the homogeneous ODE x” + x = 0 is 
x(t) = Ci cos +C} sint, (3.52) 


where C; and C are constant parameters. let us look for the solution of (3.50) in the 
form 
x (t) = C; (t) cost + C. (t) sint, (3.53) 


which is obtained from (3.52) by replacing the constants by functions (hence, the name 
of the method “variation of parameters”). To obtain the equations for the unknown 
functions C; (t) , C2 (t), differentiate (3.53): 
x(t) = —C;,(t)sint + C2 (t) cost (3.54) 
+C} (t) cost + C} (t) sint. 
The first equation for C;,C2 comes from the requirement that the second line here (that 
is, the sum of the terms with Ci}, and C4) must vanish, that is, 


Ci cost + Cysint = 0. (3.55) 


The motivation for this choice is as follows. Switching to the normal system, one must 
have the identity 
x(t) = Ci (t) x1 (t) + Coxe (t), 


which componentwise is 


t) = Cy (t)cost + C(t) sint 
t) = C(t) (cost)’ + Co (t) (sint)’. 


x ( 
et 
Differentiating the first line and subtracting the second line, we obtain (3.55). 
It follows from (3.54) and (3.55) that 
ge” = —Ccost — Cysint 
—C{ sint + C4 cost, 
whence 
xv" +x =—C)sint+ Chcost 


(note that the terms with C; and C2 cancel out and that this will always be the case 
provided all computations are done correctly). Hence, the second equation for C} and C4, 
is 

—C{sint + Chcost = f(t), 


Solving the system of linear algebraic equations 
Ci cost + Chsint = 0 
—Cisint+ Chcost = f(t) ’ 
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we obtain 
C, = —f (t)sint, Ci = f (t) cost 
whence 
Cy= > [i (t)sintdt, Cy= ie, (t) cos tdt 
and 


“(t) =.= cost / f (t) sintdt + sint / f (t) cos tdt. 


3.7 Wronskian and the Liouville formula 


Let J be an open interval in R. 


Definition. Given a sequence of n vector functions 71,...,%, : [ — IR”, define their 
Wronskian W (t) as a real valued function on J by 


W (t) = det (0 (1) | #2(t) |..-| an (t)), 


where the matrix on the right hand side is formed by the column-vectors 71, ..., %,. Hence, 
W (t) is the determinant of the n x n matrix. 


Definition. Let 71, ...,%, are n real-valued functions on J, which are n — 1 times differ- 
entiable on J.. Then their Wronskian is defined by 


Lemma 3.11 (a) Let %1,...,%, be a sequence of R"-valued functions that solve a linear 
system x! = A(t)ax, and let W (t) be their Wronskian. Then either W (t) = 0 for all 
t € I and the functions x1,...,%, are linearly dependent or W (t) £0 for allt € I and the 
functions ©1,...,%», are linearly independent. 

(b) Let 11,...,2%p be a sequence of real-valued functions that solve a linear system ODE 


o™ + a (t)2) +... +a, (t) 2 = 0, 


and let W (t) be their Wronskian. Then either W (t) = 0 for allt € I and the functions 
L1,+..,Ln are linearly dependent or W (t) £0 for allt € I and the functions x1, ...,U%p are 
linearly independent. 


Proof. (a) Indeed, if the functions 71, ...,%, are linearly independent then, by Lemma 
3.9, the vectors 2 (t),...,@n (t) are linearly independent for any value of t, which im- 
plies W (t) £ 0. If the functions 2,...,2, are linearly dependent then also the vectors 


x1 (t),...,2n (t) are linearly dependent for any t, whence W (t) = 0. 
(b) Define the vector function 
Xk 
Xk = oh 
xt 
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so that x1, ...,.x, is the sequence of vector functions that solve a vector ODE x’ = A(t) x. 
The Wronskian of x1, ...,X, is obviously the same as the Wronskian of 71, ...,%,, and the 
sequence Xj,...,X, is linearly independent if and only so is 7%1,...,%,. Hence, the rest 
follows from part (a). @ 


Theorem 3.12 (The Liouville formula) Let {2;};, be a sequence of n solutions of the 
ODE «' = A(t)x, where A: I > R"*” is continuous. Then the Wronskian W (t) of this 
sequence satisfies the identity 


W (t) = W (to) exp (i (T) ir) ‘ (3.56) 


J to 


for all t,to € I. 


Recall that the trace (Spur) trace A of the matrix A is the sum of all the diagonal 
entries of the matrix. 

Proof. Let the entries of the matrix (x1| 29|.../a,) be x;; where 7 is the row index and 
j is the column index; in particular, the components of the vector x; are ©1;, 2;, ...,Unj- 
Denote by r; the i-th row of this matrix, that is, 7; = (%j1, Vig, ..., Lin); then 


We use the following formula for differentiation of the determinant, which follows from 
the full expansion of the determinant and the product rule: 


a Ty r1 
/ 
W' (t)=det} "2 | +det} "2 | +...4det| ” (3.57) 
Tn Tn i, 


Indeed, if fi (t),..., fn (t) are real-valued differentiable functions then the product rule 
implies by induction 


(fisiefn). = fiiedn at tidedn et fips, 


Hence, when differentiating the full expansion of the determinant, each term of the de- 
terminant gives rise to n terms where one of the multiples is replaced by its derivative. 
Combining properly all such terms, we obtain that the derivative of the determinant is 
the sum of n determinants where one of the rows is replaced by its derivative, that is, 
(3.57). 

The fact that each vector «; satisfies the equation x, = Ax; can be written in the 
coordinate form as follows 


k=1 
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For any fixed i, the sequence {2;; y is nothing other than the components of the row 
r;. Since the coefficients Aj; do not depend on j, (3.58) implies the same identity for the 


TOws: a 
! > 
i = Agel k , 
k=1 


That is, the derivative 1’, of the i-th row is a linear combination of all rows r;. For example, 
ry = Aura + Arro +... + Aintn 
which implies that 


* TY T2 Tn 
det "2 = Ay det re + Aj det i Sees Ain det mS 


All the determinants except for the 1st one vanish since they have equal rows. Hence, 


1 T1 
det fe => Ay det a = AiyW (t) . 
Tn ln 


Evaluating similarly the other terms in (3.57), we obtain 
W' (t) = (Aur + Ago +... + Ann) W (t) = (trace A) W (t). 


By Lemma 3.11, W (t) is either identical 0 or never zero. In the first case there is nothing 
to prove. In the second case, we can solve the above ODE using the method of separation 
of variables. Indeed, dividing it W (t) and integrating in t, we obtain 


Wt) _ f° 
”WUin) -[ trace A (rT) dr 


(note that W (t) and W (to) have the same sign so that the argument of In is positive), 
whence (3.56) follows. m= 


Corollary. Consider a scalar ODE 
t™ + a(t) +... +a, (t) 2 = 0, 


where a, (t) are continuous functions on an interval I C R. If x1 (t),...,¢n(t) are n 
solutions to this equation then their Wronskian W (t) satisfies the identity 


W (t) = W (to) exp (- J ay (T) ir) (3.59) 


Proof. The scalar ODE is equivalent to the normal system x’ = Ax where 


0 1 0 aa 0 d 

0 0 i. 2 YO - 
A= a and x= 

0 0 0 1 (net) 

an an-1 An—2 ay 


Since the Wronskian of the normal system coincides with W (t), (3.59) follows from (3.56) 
because trace A = —a,. @ 
In the case of the ODE of the 2nd order 


a” +a; (t)a' +a. (t)x =0 


the Liouville formula can help in finding the general solution if a particular solution is 
known. Indeed, if 29 (t) is a particular non-zero solution and x (t) is any other solution 
then we have by (3.59) 


ro «x 
det (7  ) =Cem(- faa), 


Tor’ — xx = Cexp (- fos (t) it) . 


that is 


Using the identity 


we obtain the ODE at. he (- fa it) 
(=) = ————_—_,——_., (3.60) 
Lo Xo 
and by integrating it we obtain = and, hence, « (cf. Exercise 35). 
Example. Consider the ODE 


gg" —2 (1 + tan’ t) xz = 0. 
One solution can be guessed 2 (t) = tant using the fact that 


am i : tan?t+1 
—tant= = tan 
dt cos? t 


and 
2 


d 
qa vant = 2tant (tan? ¢ + 1) ; 


Hence, for x (t) we obtain from (3.60) 
( zt .; 2, aS 
tant/ tan? t 


dt 
r= Ctant | = Ctant (—t — cott+ C}). 


whence? 


Renaming the constants, we obtain the answer 


x(t) = C; (ttant + 1) +C) tant. 


°To evaluate the integral { “5, = f cot? tdt use the identity 
(cot t)’ = —cot?#-1 
that yields 
[cot tdt = -—t—cott+C. 


Ot 


3.8 Linear homogeneous systems with constant coefficients 


Here we will be concerned with finding the general solution to linear systems of the form 
x’ = Ax where A € C”*” is a constant n x n matrix with complex entries and x(t) is a 
function from R to C”. As we know, it suffices to find n linearly independent solutions 
and then take their linear combination. We start with a simple observation. Let us try 
to find a solution in the form x = ev where v is a non-zero vector in C” that does not 
depend on t. Then the equation x’ = Ax becomes 


ey = e* Av 


that is, Av = Av. Recall that any non-zero vector v that satisfies the identity Av = Av 
for some constant . is called an eigenvector of A, and X is called the eigenvalue. Hence, 
the function x (t) = ev is a non-trivial solution to x! = Ax provided v is an eigenvector 
of A and 4 is the corresponding eigenvalue. 

The fact that is an eigenvalue means that the matrix A— Aid is not invertible, that 
is, 

det (A — Aid) = 0. (3.61) 

This equation is called the characteristic equation of the matrix A and can be used to 
determine the eigenvalues. Then the eigenvector is determined from the equation 


(A— did) v =0. (3.62) 


Note that the eigenvector is not unique; for example, if v is an eigenvector then cv is also 
an eigenvector for any constant c. 
The function 
P(A) := det (A — Aid) 


is clearly a polynomial of of order n. It is called the characteristic polynomial of the 
matrix A. Hence, the eigenvalues of A are the root of the characteristic polynomial P (A). 


Lemma 3.13 [fan xn matrix A has n linearly independent eigenvectors V1, ...,Un with 
the (complex) eigenvalues \1,...,An then the general complex solution of the ODE a! = Ax 
is given by 


a(iy= > Cee, (3.63) 
k=1 


where C',...,C, are arbitrary complex constants.. _ 
If A is a real matrix and X is a non-real eigenvalue of A with an eigenvector v then A 
is an eigenvalue with eigenvector DU, and the terms ev, eD in (3.63) can be replaced by 


the couple Re (ev), Im (ev). 


Proof. As we have seen already, each function e**'v, is a solution. Since vectors 


{v~},—, are linearly independent, the functions fertug) are linearly independent, 
whence the first claim follows from Theorem 3.1. 

If Av = Av then applying the complex conjugation and using the fact the entries of 
A are real, we obtain Ad = \v so that \ is an eigenvalue with eigenvector 0. Since the 


functions ev and eG are solutions, their linear combinations 


erty + eG a4 erty — rT 
——— and Ime“v = ————_ 


R At = 
soi 2 i 
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are also solutions. Since e’v and e*’U can also be expressed via these solutions: 
ety = Ree“v+ilme™y 
eG = Ree“v—ilme™y, 


replacing in (3.63) the terms e™’,e*’ by the couple Re (e*'v), Im (ev) does not change 
the set of functions, which finishes the proof. m 

It is known from Linear Algebra that if A has n distinct eigenvalues then their eigen- 
vectors are automatically linearly independent, and Lemma 3.13 applies. Or if A is a 
symmetric matrix then there is a basis of eigenvectors, and Lemma 3.13 applies. 


ey 
y =a" 


The vector form of this system is x = Ax where A = ( 


Example. Consider the system 


0 1 
1 0 


P(A) = det ( {ao 


) . The characteristic 


polynomial is 


the characteristic equation is \*7 — 1 = 0, whence the eigenvalues are \; = 1, A» = —1. 
For \ = A; = 1 we obtain the equation (3.62) for v = (§): 


aes 


which gives only one independent equation a — b = 0. Choosing a = 1, we obtain b = 1 


whence 
ee 1 
Lelong, A) 
Similarly, for \ = Ap = —1 we have the equation for v = (°) 
11 a 
(i)G)-° 
which amounts to a+ b=0. Hence, the eigenvector for 42 = —1 is 


_ 1 
Vg = ={ 7 
Since the vectors v; and v2 are independent, we obtain the general solution in the form 
1 1 Cre + Coe 
_ t -t = 1 2 
x(t) =Che (1 ) Gre Lee ; 


that is, x(t) = Cie’ + Coe and y (t) = Cye’ — Coe. 


Example. Consider the system 


0 


The matrix of the system is A = ( 1 


fi ) , and the the characteristic polynomial is 


P(A) = det ( 7 Be rar 


Hence, the characteristic equation is \7+1 = 0 whence Ay = iand Ay = —i. For\ = Ay =i 


we obtain the equation for the eigenvector v = () 


2)(3)-« 


which amounts to the single equation ia+6b = 0. Choosing a = 7, we obtain b = 1, whence 


(1) 


and the corresponding solution of the ODE is 


ee: eit t-\ of sit +t cost 
ENA te ‘oy aa cost +isint j 
Since this solution is complex, we obtain the general solution using the second claim of 
Lemma 3.13: 


_ _ —sint cost \ — f —Cisint + C2 cost 
x(t) = Cr Rex: + C2Imxy =61( cost )+e ( sint ) i ( C, cost + Cy sint i 


Example. Consider a normal system 


ey 

g= 0; 
This system is trivially solved to obtain y = C, and x = Cyt + C,. However, if we 
try to solve it using the above method, we fail. Indeed, the matrix of the system is 


A= ( : ; ) , the characteristic polynomial is 


P(A) = det ( ee = ) =e 


and the characteristic equation P (\) = 0 yields only one eigenvalue \ = 0. The eigenvec- 


a 


tor v= ( if satisfies the equation 


whence b = 0. That is, the only eigenvector (up to a constant multiple) is v = ( ; ) . 


and the only solution we obtain in this way is x(t) = . The problem lies in the 


1 
0 
properties of this matrix — it does not have a basis of eigenvectors, which is needed for 
this method. 


In order to handle such cases, we use a different approach. 
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3.8.1 Functions of operators and matrices 


Recall that an scalar ODE x’ = Az has a solution x(t) = Ce“‘t. Now if Aisanxn 
matrix, we may be able to use this formula if we define what is e4’. It suffices to define 
what is e4 for any matrix A. It is convenient to do this for linear operators acting in C”. 

Recall that a linear operator in C” is a mapping A : C” — C” such that, for all 
z,yEeEC" andr €C, 


A(z+ty) = Axr+Ay 
A(Az) = Az. 


Any n x n matrix defines a linear operator in C” using multiplication of column-vectors 
by this matrix. Moreover, any linear operator can be represented in this form so that 
there is an one-to-one correspondence!” between linear operators and matrices. 
Denote the family of all linear operators in C” by £ (C"”). For any two operators A, B, 
define their sum A+ B by 
(A B)ae= Avs Bx 


and the product by a scalar \ € C by 
(AA) (a) = AAz, 


for all x € C”. With these operation, £ (C”) is a linear space over C. Since any operator 
can be identified with a n x n matrix, the dimension of the linear space £L(C”) is n?. 
Apart from the linear structure, the product AB of operators is defined in £(C") as 
composition that is, 
(AB eS Abr). 


Fix a norm || - || in C”, for example, the oo-norm 


al|so:t= naa: [a | 
1<k<n 


where 71, ...,%, are the components of the vector x. Define the associated operator norm 
in £(C”) by 
|| Aa 


xreC”\{0} Kal 


|| All = (3.64) 
Claim. The operator norm is a norm in £L(C”). 

Proof. Let us first show that ||A|| is finite. Represent A is a matrix (A,;) in the 
standard basis. Since all norms in any finitely dimensional linear space are equivalent, we 
can assume in the sequel that ||x|| = |||... Then 


||Acloo = max|(A:r),| = max 


Se Aingts 
j 


IA 


max max |xj| = C'|2l00, 
J 


> Ags 
J 


where C’ < oo. Therefore, ||A|]| < C < 00. 


10This correspondence depends on the choice of a basis in C”. 
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2. Clearly, || A|| > 0. Let us show that ||A|| > 0 if A 4 0. Indeed, if A 4 0 then there 
is x € C” such that Ax ¥ 0 and || Az|| > 0, whence 


All > || Ax 
~ [ell 


3. Let us prove the triangle inequality: ||A + B|| < ||A]| + ||B]]. Indeed, by definition 
(3.64) 


> 0. 


(A+ Ball — HAs + IB 


|A + B|| = sup < 
a ||| 2 || 
A B 
<oupll4tlh guy Bel 
x ||2| x ||| 
= |All] + |B Il. 


4. Let us prove the scaling property: ||AA|| = |A| || Al] for any A € C. Indeed, by (3.64) 


AA) x A| || Ax 
HOA) ALAA yy ay 


|x| al 


|| AAl| = sup 


rT] 
In addition to the general properties of a norm, the operator norm satisfies the in- 
equality 
|| ABI] < |All BI. (3.65) 


Indeed, it follows from (3.64) that || Az|| < ||A]| ||a]| whence 
(AB) 2|| = ||A(Be)|| < |All Bel] < |All BT lel 


which yields (3.65). 

Hence, £(C”) is a normed linear space. Since this space is finite dimensional, it is 
complete as a normed space. As in any complete normed linear space, one can define in 
L£(C") the notion of the limit of a sequence of operators. Namely, we say that a sequence 
{ A; } of operators converges to an operator A if 


||A, — Al| — 0 as k — oo. 


Representing an operator A as a matrix (Aij)i j-1) one can consider the oo-norm on 
operators defined by 

Alloo = max |A;,|. 

|Alloo = max [Ais 


Clearly, the convergence in the oo-norm is equivalent to the convergence of each component 
Aj; separately. Since all norms in £(C”) are equivalent, we see that convergence of a 
sequence of operators in any norm is equivalent to the convergence of the individual 
components of the operators. 

Given a series }*7°, A; of operators, the sum of the series is defined as the limit of 
the sequence of partial sums )~’_, Ay as N — 00. That is, S = 57%, As if 


N 
IS — S> Axl] 0 as N > oo. 


k=1 
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Claim. Assume that 


S| Agl| < 00. (3.66) 
k=1 


Then the series S~7°, Ap converges. 

Proof. Indeed, since all norms in £ (C”) are equivalent, we can assume that the norm 
in (3.66) is the oo-norm. Denoting by (A;),,; the 7j-components of the matrix A, we obtain 
that then the condition (3.66) is equivalent to 


a (Ae), 


k=1 


< 00 (3.67) 


for any indices 1 < i,j <n. Then (3.67) implies that the numerical series 


converges, which implies that the operator series S*7°., A; also converges. & 


If the condition (3.66) is satisfied then the series )\7~_, Ax is called absolutely convergent. 
Hence, the above Claim means that absolute convergence of an operator series implies 
the usual convergence. 


Definition. If A € £(C”) then define e“ € L(C”) by means of the identity 


2 Ak oo Ak 
Fe cane aie eis Reo (3.68) 


where id is the identity operator. 


Of course, in order to justify this definition, we need to verify the convergence of the 
series (3.68). 


Lemma 3.14 The exponential series (3.68) converges for any A € £(C"). 


Proof. It suffices to show that the series converges absolutely, that is, 
k=0 
It follows from (3.65) that |) A*|| < ||A||* whence 


lo) 
k=0 


< Ow. 


Ak 
kl 


k 
< < = SIAN 25 


= |All" 
2. i 


and the claim follows. 


Theorem 3.15 For any A € £L(C") the function F (t) = e' satisfies the ODE F' = AF. 
Consequently, the general solution of the ODE x' = Ax is given by x = e'4v where v € C” 
is an arbitrary vector. 
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Here x = x(t) is as usually a C”-valued function on R, while F(t) is an £(C")- 
valued function on R. Since £(C") is linearly isomorphic to C”’, we can also say that 
F (t) is a C”’-valued function on R, which allows to understand the ODE F’ = AF in 
the same sense as general vectors ODE. The novelty here is that we regard A € £(C”) 
as an operator in £(C") (that is, an element of £(£(C"”))) by means of the operator 
multiplication. 

Proof. We have by definition 


Consider the series of the derivatives: 


oo) d tk Ak oD tk-L 4k £0 pk-1 Ak-l 
t = ae = = ——“—“ = A a AF. 
60= ba (Gr) "Lab ea 
k=0 k=1 k=1 
It is easy to see (in the same way as Lemma 3.14) that this series converges locally 
uniformly in t, which implies that F is differentiable in t and F’ = G. It follows that 
F’= AF. 


For function x (t) = e'4 


v, we have 


so that x (t) solves the ODE 2’ = Az for any v. 
If w(t) is any solution to x’ = Ax then set v = x (0) and observe that the function 


Ay satisfies the same ODE and the initial condition 


et 
e4y|,-0 = idu = v. 


Hence, both x(t) and e’4v solve the same initial value problem, whence the identity 
x (t) = e'4v follows by the uniqueness theorem. m 


Remark. If vj, ..., v; are linearly independent vectors in C” then the solutions ey, ...., e’Up, 


are also linearly independent and, hence, can be used to form the fundamental matrix. 
In particular, choosing v),...,Un to be the canonical basis in C”, we obtain that e’4v, is 
the k-th column of the matrix e’“. Hence, the matrix e’4 is itself a fundamental matrix 
of the system 2’ = Ax. 


Example. Let A be the diagonal matrix 


A = diag (Ai, ..., An) - 


Then 
AY = diag (tio 8) 
and 
et4 — diag (ens er em") : 
Let 


nes, 
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Then A? = 0 and all higher power of A are also 0 and we obtain 


$A os _ 1 ¢ 
€ =id4ta=( 5 i): 


Hence, the general solution to 2’ = Ax is 


agers (in) kee) 


where C, Cy are the components of v. 


Definition. Operators A,B € £(C") are said to commute if AB = BA. 


In general, the operators do not have to commute. If A and B commute then various 
nice formulas take places, for example, 


(A+B) = A? +2AB + B?. (3.69) 
Indeed, in general we have 
(A+ BY =(A4+ B)(A+ B) =A? +AB+ BA+ B’, 
which yields (3.69) if AB = BA. 


Lemma 3.16 Jf A and B commute then 


Proof. Let us prove a sequence of claims. 


Claim 1. [f A,B,C commute pairwise then so do AC and B. 
Indeed, 


(AC) B = A(CB) = A(BC) = (AB)C =(BA)C = B(AC). 


Claim 2. [f A and B commute then so do e4 and B. 
Indeed, it follows from Claim 1 that A* and B commute for any natural k, whence 


Claim 3. If A(t) and B(t) are differentiable functions from R to £(C") then 
(A(t) B(t))’ = A’ (t) B(t) + A(t) B’ (t). (3.70) 


Warning: watch the correct order of the multiples. 
Indeed, we have for any component 


/ 
(AB);, = > Aub) = So Ap Buyt)_ An Br, = (A’B),,+(AB’),, = (A’B ey, AB’),; ’ 
k k k 
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whence (3.70) follows. 
Now we can finish the proof of the lemma. Consider the function F' : R — £(C") 
defined by 
F (t) = etA (tf Be 


Differentiating it using Theorem 3.15, Claims 2 and 3, we obtain 
F’ (t) = (Ay oP re (ey = Act etB 4 tA Bel® = Act4et? + BetAe® = (A+B) F(t). 


On the other hand, by Theorem 3.15, the function G(t) = e’4+®) satisfies the same 
equation 
G' =(A+B)G. 

Since G'(0) = F'(0) = id, we obtain that the vector functions F(t) and G'(t) solve the 
same IVP, whence by the uniqueness theorem they are identically equal. In particular, 
F (1) = G(1), which means e4e? = e4+?. 

Alternative proof. Let us briefly discuss a direct algebraic proof of e 
One first proves the binomial formula 


(A+B)"= 3 @ aR 


k=0 


A+B _ (AB 


using the fact that A and B commute (this can be done by induction in the same way as 
for numbers). Then we have 


(A+B) Gwe AtB 
aia nl gs ca 


and, using the Cauchy product formula, 


Of course, one need to justify the Cauchy product formula for absolutely convergent series 
of operators. # 
3.8.2 Jordan cells 


Here we show how to compute e“ provided A is a Jordan cell. 


Definition. An n x n matrix J is called a Jordan cell if it has the form 


Ae’ cal 0 --- O 
0 A 
A=| : 0 |. (3.71) 
: vA 1 
0 0 dA 


where A is any complex number. 
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Here all the entries on the main diagonal are \ and all the entries just above the main 
diagonal are 1 (and all other values are 0). Let us use Lemma 3.16 in order to evaluate 
e'A where A is a Jordan cell. Clearly, we have A = \id +N where 


0 1 O -:- O 


A matrix (3.72) is called a nilpotent Jordan cell. Since the matrices \ id and N commute 
(because id commutes with anything), Lemma 3.16 yields 


eA Zs elrid tN s elretN (3.73) 


Hence, we need to evaluate e’’, and for that we first evaluate the powers N?, N?, etc. 
Observe that the components of matrix N are as follows 


Le type 
Nay = { 0, otherwise 


v] 


where 7 is the row index and 7 is the column index. It follows that 


- 1, ifj=i+2 
("=o Mae ={ 
k=1 


otherwise 
that is, 
0 O 1 0 
2 
N°= 1 
: . 0 
(teak. eles Hae - 0) 


Here the entries with value 1 are located on the diagonal that is two positions above the 
main diagonal. Similarly, we obtain 


0 1 0 

k 
ie 1 
Gime oatk~ doe 1H 


where the entries with value 1 are located on the diagonal that is k positions above the 
main diagonal, provided k <n, and N* = 0 if k > n. 

Any matrix A with the property that A* = 0 for some natural k is called nilpotent. 
Hence, N is a nilpotent matrix, which explains the term “a nilpotent Jordan cell”. It 
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follows that 


2 “ n—1 
Lc Soe can 
tN t aoe (ee 1 fe et ae ee 
Sad tN PN Hb et N= a 23 : . . (3.74 
e 1 7 I HoT a Pes Oe a (3.74) 
7 
Gi Sit are iT 
Combining with (3.73), we obtain the following statement. 
Lemma 3.17 If A is a Jordan cell (3.71) then, for anyt € R, 
ext £ lA a etd os eld 
0 el 4 tr a 
eA = : .| om .) t? tr . (3.75) 
: P : . ae 
r+, tet 
0 0 en 


By Lemma 3.15, the general solution of the system x’ = Az is x (t) = e’4v where v is 


an arbitrary vector from C”. Setting v = (C\,...,C;,), we obtain that the general solution 
is 


x(t)=Cyi +..+Cprn, 


where 21,...,2, are the columns of the matrix e’4 (which form a sequence of n linearly 
independent solutions). Using (3.75), we obtain 


ai tySe™ (1:0;.2,0) 


t 

ote (5 1,0, 0) 
ft 

a (t) = e™ € =e 1,0, 0) 


Alias ti 


3.8.3 Jordan normal form 


Definition. If A is am x m matrix and B is al x | matrix then their tensor product is 
ann xX n matrix C’ where n = m+1 and 


Alo 
aan ORES 


That is, matrix C’ consists of two blocks A and B located on the main diagonal, and all 
other terms are 0. 


Notation for the tensor product: C = A® B. 
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Lemma 3.18 The following identity is true: 
e488 — @A @e®. (3.76) 


In extended notation, (3.76) means that 


as 
= 7 
ae 


Proof. Observe first that if A,;, A2 are m x m matrices and B,, By are | x | matrices 
then 


(A; @ B,) (Ag ® Bg) = (Ai Az) ® (Bi By) . (3.77) 
Indeed, in the extended form this identity means 
A, | 0 | A2 0 | A;A.| 0 | 
0 |B, | 9 Bo |} 0 By Bz | 


which follows easily from the rule of multiplication of matrices. Hence, the tensor product 
commutes with the matrix multiplication. It is also obvious that the tensor product 
commutes with addition of matrices and taking limits. Therefore, we obtain 


oe) k oo k k oo k Sad k 

A@B __ (A@ B) _ A @B = A B Lf B 

Se ae pa Oe a ore 
k=0 k=0 k k=0 

|| 


Definition. A tensor product of a finite number of Jordan cells is called a Jordan normal 
form. 


That is, if a Jordan normal form is a matrix as follows: 
Ji 
Jy 0 
JI,O 28: OA = ce 


where J; are Jordan cells. 
Lemmas 3.17 and 3.18 allow to evaluate e'4 if A is a Jordan normal form. 


Example. Solve the system x’ = Ax where 


oOo KF eK 
ON OO 
| NO a DD) 


Clearly, the matrix A is the tensor product of two Jordan cells: 


1 il 2 1 
A=(4 i) and b= (5 a 
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By Lemma 3.17, we obtain 


e te’ 0 O 

wa_{ 0 & 0 0 
1! Oh De se te 
0 0 0 e 


The columns of this matrix form 4 linearly independent solutions 


C= Oe) 
2 = (te’, et , 0,0) 
t3 = (0,0,e”,0) 
wm = (0, 0, te”, e =“) 


and the general solution is 


x (t) = Cyr, + Core + C3xr3 + Cyx4 
(Cie’ + Cote’, Coe’, Gre" + Cyte”*, Ce”) . 


3.8.4 Transformation of an operator to a Jordan normal form 


Given a basis b = {bj, bo,...,b,} in C” and a vector x € C”, denote by x? the column 
vector that represents x in this basis. That is, if x? is the i-th component of x? then 


a= 2b) +abbo +... +2°d, = So why. 
i=1 


Similarly, if A is a linear operator in C” then denote by A? the matrix that represents A 
in the basis 6. It is determined by the identity 


(Ax)? = Abs’, 


which should be true for all x € C”, where in the right hand side we have the product of 
the n x n matrix A® and the column-vector x?. 

Clearly, (b;)’ = (0, ...1,...0) where 1 is at position i, which implies that (Ab;)’ = A? (b;)° 
is the i-th column of A’. In other words, we have the identity 


A? = ((Abi)? | (Aba)’ | «+ | (Abn)*), 
that can be stated as the following rule: 


the i-th column of A° is the column vector Ab; written in the basis by,..., by. 
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Example. Consider the operator A in C? that is given in the canonical basis e = {e1, e2} 


by the matrix 
ge EOP of 
a(02), 


Consider another basis b = {b1, b2} defined by 


h=a-a-(7,) and b=ata=({ J. 
are(29)()209 
o-(22)(2)-() 


It follows that Ab; = —b; and Aby = by whence 
ae oe 
= (2°). 

The following theorem is proved in Linear Algebra courses. 
Theorem. For any operator A € £L(C") there is a basis b in C” such that the matrix A® 
is a Jordan normal form. 

Let J be a Jordan cell of A® with \ on the diagonal and suppose that the rows (and 
columns) of J in A? are indexed by j,j +1,...,7 +p—1so0 that J isa px p matrix. Then 


the sequence of vectors ;, ..., bj+p)-1 is referred to as the Jordan chain of the given Jordan 
cell. In particular, the basis 6 is the disjoint union of the Jordan chains. 


Then 


and 


Since 
Goo jtpel 
L L 
0 1 0 -j 
Are = : 
: ee ee UL * 
i: wee” i. | j+p-1 


and the k-th column of A’ — Aid is the vector (A — id) by written in the basis b, we 
conclude that 

(A — rid) b; =0 

(A — Aid) bj41 = 8; 

(A=Nid) bps = ba 


(A — Nid) bj4p—1 = bj 4-2. 


111 


In particular, b; is an eigenvector of A with the eigenvalue A. The vectors bj+1,...,bj4p-1 
are called the generalized eigenvectors of A (more precisely, b;,1 is the 1st generalized 
eigenvector, bj;;2 is the second generalized eigenvector, etc.). Hence, any Jordan chain 
contains exactly one eigenvector and the rest vectors are the generalized eigenvectors. 


Theorem 3.19 Consider the system x' = Ax with a constant linear operator A and let 
A® be the Jordan normal form of A. Then each Jordan cell J of A® of dimension p with 
Xd on the diagonal gives rise to p linearly independent solutions as follows: 


a, (t) =e%vy, 


t 
w(t) =e (Fa + w») 
i t 
x3 (t) =e (5a aia UT ws) 


2! 1! 


At Po | | t | 
te Se (p iyi bot 7pUp-i t+ Up |, 


where {v1,...,Up} is the Jordan chain of J. The set of all n solutions obtained across all 
Jordan cells is linearly independent. 


Proof. In the basis b, we have by Lemmas 3.17 and 3.18 


At t tr thr Str 
e Tre (pair 
LA® _ 0) et 
€ = ’ 
ne t utr 
. Tre 
0 0 tr 


where the block in the middle is e'7. By Lemma 3.15, the columns of this matrix give 
n linearly independent solutions to the ODE x’ = Ax. Out of these solutions, select p 
solutions that correspond to p columns of the cell e“”, that is, 


SAA eocel Cea ua eeeced | eee 


a ee 
Pp 
ble SOS 00.202.) 
—_—_—_———- 
Pp 
-1 
Up (t) = ( : eyes ese) te™, en ie 
ee 
Pp 


where all the vectors are written in the basis b, the horizontal braces mark the columns of 
the cell J, and all the terms outside the horizontal braces are zeros. Representing these 
vectors in the coordinateless form via the Jordan chain 2, ..., Up, we obtain the solutions 
as in the statement of Theorem 3.19. m 

Let A be an eigenvalue of an operator A. Denote by m the algebraic multiplicity of 
A, that is, its multiplicity as a root of characteristic polynomial!’ P (A) = det (A — Aid). 
Denote by g the geometric multiplicity of A, that is the dimension of the eigenspace of .: 


g = dim ker (A — Aid). 


In other words, g is the maximal number of linearly independent eigenvectors of A. The 
numbers m and g can be characterized in terms of the Jordan normal form A?® of A as 
follows: m is the total number of occurrences of on the diagonal!” of A°, whereas g is 
equal to the number of the Jordan cells with \ on the diagonal!?. It follows that g < m 
and the equality occurs if and only if all the Jordan cells with the eigenvalue have 
dimension 1. 

Despite this relation to the Jordan normal form, m and g can be determined without 
a priori finding the Jordan normal form, as it is clear from the definitions of m and g. 


Theorem 3.19’ Let \ € C be an eigenvalue of an operator A with the algebraic multiplicity 
m and the geometric multiplicity g. Then gives rise to m linearly independent solutions 
of the system x’ = Ax that can be found in the form 


a(t) = e™ (uy + ust +... + ust* 1) (3.78) 


where s =m—g+1 and u; are vectors that can be determined by substituting the above 
function to the equation a’ = Ax. 

The set of all n solutions obtained in this way using all the eigenvalues of A is linearly 
independent. 


Remark. For practical use, one should substitute (3.78) into the system x’ = Ax con- 
sidering u;; as unknowns (where u,;; is the i-th component of the vector u;) and solve the 
resulting linear algebraic system with respect to u,;. The result will contain m arbitrary 
constants, and the solution in the form (3.78) will appear as a linear combination of m 
independent solutions. 


Proof. Let py, ..,p, be the dimensions of all the Jordan cells with the eigenvalue \ (as 
we know, the number of such cells is g). Then \ occurs p; + ... + pg times on the diagonal 
of the Jordan normal form, which implies 


g 
j=l 


'lTo compute P(A), one needs to write the operator A in some basis b as a matrix A, and then 
evaluate det (A, — Aid). The characteristic polynomial does not depend on the choice of basis b. Indeed, 
if b’ is another basis then the relation between the matrices A, and Ay is given by Ap = CAy C7! 
where C is the matrix of transformation of basis. It follows that Ay — \id = C (Ay — Aid) C7! whence 
det (Ap — Aid) = det C det (Ay — Aid) det C~! = det (Ay — Aid). 

!2Tf X occurs k times on the diagonal of A, then \ is a root of multiplicity k of the characteristic 
polynomial of A, that coincides with that of A. Hence, k = m. 

13Note that each Jordan cell correponds to exactly one eigenvector. 
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Hence, the total number of linearly independent solutions that are given by Theorem 3.19 
for the eigenvalue A is equal to m. Let us show that each of the solutions of Theorem 3.19 
has the form (3.78). Indeed, each solution of Theorem 3.19 is already in the form 


e® times a polynomial of t of degree < peal 


To ensure that these solutions can be represented in the form (3.78), we only need to 
verify that p; — 1< s—1. Indeed, we have 


d@-D= (s7») = tg = 5 ly 


j=l j=l 


whence the inequality p; — 1 < s—1 follows. m 

In particular, if m = g, that is, s = 1, then m independent solutions can be found in 
the form x(t) = ev, where v is one of m independent eigenvectors of \. This case has 
been already discussed above. Consider some examples, where g < m. 


Example. Solve the system 


The characteristic polynomial is 


2K 1 


P (A) = det (A ~ Aid) = det ( ear Mee 


) =¥-oa+9=0-39, 


and the only eigenvalue is A; = 3 with the algebraic multiplicity m; = 2. The equation 
for an eigenvector v is 


(A— Aid) v =0 


—-1 1 a 
Carta 
which is equivalent to —a +b = 0. Setting a = 1 and b = 1, we obtain the unique (up to 


a constant multiple) eigenvector 
1 
Uu= ( 1 ) ‘ 


Hence, the geometric multiplicity is g; = 1. Hence, there is only one Jordan cell with 
the eigenvalue ;, which allows to immediately determine the Jordan normal form of the 


given matrix: 
Seok 
(as a ae 


By Theorem 3.19, we obtain the solutions 


that is, for v = (a,), 


a(t) = ey 
a(t) = e* (tv, +2) 


where v2 is the 1st generalized eigenvector that can be determined from the equation 


(A - A id) Vg = Uj. 
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Setting v2 = (a,b), we obtain the equation 


ae 


which is equivalent to —a +b = 1. Hence, setting a = 0 and b = 1, we obtain 


wa 


whence 


Finally, the general solution is 


_ 34 f Cr+ Cat 
x(t) = Cia, + Cora =e Carer, ; 


Example. Solve the system 


2 leo «oh 
ge =| -2 0 -1 Ja 
2 1 2 
The characteristic polynomial is 
2-2 1 1 
P(A) = det (A-— did) = det —2 -rA -Il 


2 1 2-A 
= —\3+4+4)?—5442=(2-A)(A- 1). 


The roots are Ay = 2 with m; = 1 and A» = 1 with mz = 2. The eigenvectors v for A; are 
determined from the equation 


(A — Az id) v = 0, 
whence, for v = (a, b,c) 
0 1 1 a 
ae ee b | =0, 
2 1 O Cc 
that is, 
b+c=0 
—2a —2b—c=0 
2a+b=0. 
The second equation is a linear combination of the first and the last ones. Setting a = 1 
we find b = —2 and c = 2 so that the unique (up to a constant multiple) eigenvector is 
1 
v=| —2 |, 
2 


which gives the first solution 


x(t) =e* | —2 
2 


The eigenvectors for Ay = 1 satisfy the equation 
(A — Agid) v =0, 


whence, for v = (a,b,c), 


1 1 1 a 
a ese | b | =0, 
2 1 1 Cc 
whence 
a+b+c=0 
—2a—b—c=0 
2a+b+c=0. 
Solving the system, we obtain a unique (up to a constant multiple) solution a = 0, b = 1, 
c= —1. Hence, we obtain only one eigenvector 
0 
U= 1 
—l 


Therefore, go = 1, that is, there is only one Jordan cell with the eigenvalue 2, which 
implies that the Jordan normal form of the given matrix is as follows: 


2 0 0 
0 11 
0 0 1 


By Theorem 3.19, the cell with A2 = 1 gives rise to two more solutions 


and 
U3 (t) = et (tv; + V2) ; 


where v2 is the first generalized eigenvector to be determined from the equation 
(A = A2 id) Vg = Uj. 


Setting v2 = (a,b,c) we obtain 


a | a 0 
2 ele 2 b)=!] 1 |, 
mse <a C =i 
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that is 


atrote=0 
—2a—b-—c=1 
2a+b+c=-—l. 
This system has a solution a = —1, b= 0 and c= 1. Hence, 
—1 
Vg = 0 5 
a 


and the third solution is 


—1 
3 (t) = e’ (tv; + v2) =e t 
1-t 
Finally, the general solution is 
Ce" _ Cre! 
x(t) = Cyry + Corq + C3x3 = | —2Cye* + (C2 + Cot) e! 


2Cie” + (C3 Co Ss C3t) et 


4 Qualitative analysis of ODEs 


4.1 Autonomous systems 


Consider a vector ODE 
a ap a) (4.1) 
where the right hand side does not depend on t. Such equations are called autonomous. 


Here f is defined on an open set 2 C R” (or 2 Cc C") and takes values in R” (resp., C”), 
so that the domain of the ODE is R x . 


Definition. The set 2 is called the phase space of the ODE and any path x: I — Q, 
where x (t) is a solution of the ODE on an interval J, is called a phase trajectory. A plot 
of all phase trajectories is called a phase diagram or a phase portrait. 


Recall that the graph of a solution (or the integral curve) is the set of points (t, x (t)) 
in Rx Q. Hence, a phase trajectory can be regarded as the projection of an integral curve 
onto 22. 

Assume in the sequel that f is continuously differentiable in Q. For any y € 0, denote 
by x(t, y) the maximal solution to the IVP 


ol, 


Recall that, by Theorem 2.14, the domain of function x (t,y) is an open subset of R”+! 
and x (t,y) is continuously differentiable in this domain. 
The fact that f does not depend on t, implies the following two consequences. 
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1. If x(t) is a solution of (4.1) then also x (t — a) is a solution of (4.1), for any a € R. 
In particular, the function x (t — to, y) solves the following IVP 


(fais 


2. If f (vo) = 0 for some x € 2 then the constant function x (t) = xp is a solution of 
x' = f (x). Conversely, if x(t) = xo is a constant solution then f (xo) = 0. 


Definition. If f (zo) = 0 at some point xp € 2 then zp is called a stationary point'* of 
the ODE az’ = f (x). 

It follows from the above observation that if x is a stationary point if and only if 
x(t, Xo) = Xo. 
Definition. A stationary point 2x is called Lyapunov stable for the system x’ = f (x) (or 
the system is called stable at xo) if, for any ¢ > 0, there exists 6 > 0 with the following 
property: for all y € 2 such that ||y — xo|| < 6, the solution 2 (t, y) is defined for all t > 0 
and 


sup ||x(t,y) — xol| <e. (4.2) 
tE(0,+00) 


In other words, the Lyapunov stability means that if x (0) is close enough to x9 then 
the solution x (t) is defined for all t > 0 and 


«x (0) € B(ao,6) = > x(t) € B(x, ¢) for all t > 0. 


If we replace in (4.2) the interval (0,-+co) by any bounded interval [a, b] containing 0 then 
by the continuity of x (t, y), 


sup ||z (t,y) — xoll = sup ||z (¢,y) — x(t, 0) || > 0 as y > 20. 
tE [a,b] te [a,b] 


Hence, the main issue for the stability is the behavior of solutions as t — ++oo. 


Definition. A stationary point xo is called asymptotically stable for the system x’ = f (x) 
(or the system is called asymptotically stable at xo), if it is Lyapunov stable and, in 
addition, 

I|x (t, y) — Lol| — 0 as t > +00, 


provided ||y — xo]| is small enough. 


Observe, the stability and asymptotic stability do not depend on the choice of the 
norm in R” because all norms in R” are equivalent. 


“Tn the literature one can find the following synonyms for the term “stationary point”: rest point, 
singular point, equilibrium point, fixed point. 
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4.2 Stability for a linear system 


Consider a linear system x’ = Ax in R” where A is a constant operator. Clearly, x = 0 is 
a stationary point. 


Theorem 4.1 /f for all complex eigenvalues of A, we have ReA < 0 then 0 is asymp- 
totically stable for the system x’ = Ax. If, for some eigenvalue of A, ReX > 0 then 0 
is unstable. 


Proof. By Theorem 3.19’, the general complex solution of x’ = Ax has the form 
x(t)= > Cre Py (t), (4.3) 
k=1 


where C; are arbitrary complex constants, \1, ..., An, are all the eigenvalues of A listed with 
the algebraic multiplicity, and P, (t) are some vector valued polynomials of t. The latter 
means that P, (t) = uy + ugt +... + ust® + for some s € N and for some vectors wy, ..., Us- 
Note that this solution is obtained by taking a linear combination of mn independent 
solutions e**' P, (t). Since 


we see that the coefficients C;, are the components of x (0) in the basis {P, (0)}7_,. 
It follows from (4.3) that 


n 


lz @Il < $0 |Cre*| ILPe Ol 
k=1 
< max |O,|e®"S 7 ||P. (EI 


k=1 


Set 
a= max Re A, <0. 


Observe that the polynomials admits the estimates of the type 
Pe Ol < C(1+t%) 
for all t > 0 and for some large enough constants C' and N. Hence, it follows that 
lz I < Ce (1+ 4%) Ix (ODI, (4.4) 


Clearly, by adjusting the constant C’, we can replace ||x (0)||,, by || (0)]]. 
Since the function (1+ +) e% is bounded on (0,+00), we obtain that there is a 
constant AK such that, for all t > 0, 


I|z (|| < 4 |x (O)I), 
whence it follows that the stationary point 0 is Lyapunov stable. Moreover, since 


(14+ t%)e% +0 as t— +00, 
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we conclude from (4.4) that ||x (t) || —~ 0 as t — oo, that is, the stationary point 0 is 
asymptotically stable. 

Let now Re A > 0 for some eigenvalue X. To prove that 0 is unstable is suffices to show 
that there exists an unbounded real solution x (t), that is, a solution for which ||x (¢)|| 
is not bounded on (0,-+00) as a function of t. Indeed, if such a solution exists then the 
function ex (t) is also an unbounded solution for any ¢ > 0, while its initial value ex (0) 
can be made arbitrarily small by choosing ¢ appropriately. 

To construct an unbounded solution, consider an eigenvector v of the eigenvalue A. It 
gives rise to the solution 

x (t) = ey 


for which 
||x (¢)|| = fe™*| [lvl] = eB lla] . 


Hence, || (t)|| is unbounded. If x(t) is a real solution then this finishes the proof. In 
general, if x(t) is a complex solution then then either Rex (t) or Imz (t) is unbounded 
(in fact, both are), whence the instability of 0 follows. = 

This theorem does not answer the question what happens when Re = 0. We will 
investigate this for the case n = 2 where we also give a more detailed description of the 
phase diagrams. 

Consider now a linear system x’ = Az in R? where A is a constant operator in R?. Let 
b = {bi, by} be the Jordan basis of A so that A® has the Jordan normal form. Consider 
first the case when the Jordan normal form of A has two Jordan cells, that is, 


» {AX 9 
Rete ae 


Then 6; and by are the eigenvectors of the eigenvalues A; and A», respectively, and the 
general solution is 
ett) = Cye™!b, + Cye??*bo. 


In other words, in the basis 8, 
(= (Cie™, Coe") 
and x (0) = (Ci, C2). It follows that 
I|z (¢)||_, = max (|Cie**| : |Coe*]) = max (|C4| eteMe, |C| gra) < ||x (0) lloe™ 
where 


a@ = max (Re Ay, Re A2). 


If a < 0 then 
IIx (€) loo < Ile (0) | 


which implies the Lyapunov stability. As we know from Theorem 4.1, if a > 0 then the 
stationary point 0 is unstable. Hence, in this particular situation, the Lyapunov stability 
is equivalent to a < 0. 

Let us construct the phase diagrams of the system x7’ = Ax under the above assump- 
tions. 

Case A, A2 are real. 


120 


Let x; (t) and x2 (t) be the components of the solution « (t) in the basis {b), b2}. Then 
t= Cie and xo = Cre". 
Assuming that 1, 42 4 0, we obtain the relation between x, and x2 as follows: 
to = Cla", 


where y = A2/A1. Hence, the phase diagram consists of all curves of this type as well as 
of the half-axis x7, > 0,2; < 0,22 > 0,22 < 0. 

If y > 0 (that is, A; and A» are of the same sign) then the phase diagram (or the 
stationary point) is called a node. One distinguishes a stable node when 1, Az < 0 and 
unstable node when 1, A2 > 0. Here is a node with +y > 1: 


y LT 


-0.5 7 


and here is a node with y = 1: 


ia 


If one or both of Ay, Az is 0 then we have a degenerate phase diagram (horizontal or vertical 
straight lines or just dots). 
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If y < 0 (that is, Ay and Ag are of different signs) then the phase diagram is called a 
saddle: 


Of course, the saddle is always unstable. 
Case \, and 2 are complex, say 4; = a— 78 and Ax =a+if with 6B #0. 
Then we rewrite the general solution in the real form 


x(t) =C,Re omy, Ae, Teh. 


Note that 6b; is an eigenvector of A, and, hence, must have a non-trivial imaginary part 
in any real basis. We claim that in some real basis b; has the form (1,7). Indeed, if 
b; = (p,q) in the canonical basis e), e2 then by rotating the basis we can assume p,q # 0. 
Since b; is an eigenvector, it is defined up to a constant multiple, so that we can take 
p=1. Then, setting q = q, + iq2 we obtain 


by = e1 + (1 + tga) e2 = (€1 + Me2) + iqnes = Cf + ey 


where e) = e1 + qé2 and e5 = qe2 is a new basis (the latter follows from the fact that q 
is imaginary and, hence, q. # 0). Hence, in the basis e’ = {e}, e,} we have b; = (1,7). 
It follows that in the basis e’ 


F at BAT as 
elo P'b, = e (cos Bt + isin Bt) ( ; ) = ( e*" cos Bt — te sin Bt ) 


e% sin Bt + ie cos Bt 


and 
a e™ cos ft —e“ sin Bt \ _ e™ cos (Gt + W) 
ea ( e™ sin St ) ee ( e* cos Bt ) 7 o( e* sin (Bt +) ) 


where C' = \/C? + C? and 


= inh 
C’ C 


If (r,@) are the polar coordinates on the plane in the basis e’, then the polar coordinates 
for the solution x (t) are 


cos = 


r(t)=Ce™ and 6(t) = Bt+y. 
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If a ~ 0 then these equations define a logarithmic spiral, and the phase diagram is called 
a focus or a spiral: 


0.75 7 


0.5 7 


0.5, ORS WY) 0J25 05 0.75 1 


-0.5 7 


The focus is stable is a < 0 and unstable if a > 0. 

If a = 0 (that is, the both eigenvalues A, and A are purely imaginary), then r (t) = C, 
that is, we get a family of concentric circles around 0, and this phase diagram is called a 
center: 


In this case, the stationary point is stable but not asymptotically stable. 
Consider now the case when the Jordan normal form of A has only one Jordan cell, 


that is, 
oe oe 
aa (31). 


In this case, A must be real because if \ is an imaginary root of a characteristic polynomial 
then A must also be a root, which is not possible since \ does not occur on the diagonal 
of A. Then the general solution is 


x (t) = Qe + Cue (byt + by) = (Ci af Cot) eb, wa Che be 
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whence x (0) = C,b; + Cob. That is, in the basis b, we can write x (0) = (Ci, C2) and 
x (t) = tee (Ci + Cot) : eC) (4.5) 
whence 
eeu eC, + Cot| + e** Cy]. 


If \ < 0 then we obtain again the asymptotic stability (which follows also from Theorem 
4.1), while in the case \ > 0 the stationary point 0 is unstable. Indeed, taking C; = 0 
and C2 = 1, we obtain a particular solution with the norm 


Iz @)lh=e% E+), 


which is unbounded. 
If X # 0 then it follows from (4.5) that the components 21,22 of x are related as 
follows: 


2, Cy 1, 2 
—=—4t d t=~-lIn— 
HOD) Cy = a Xr . C 
whence 
Ly = Cx + mis al 


for some constant C’. Here is the phase diagram in this case: 


A> 


This phase diagram is also called a node. It is stable if X < 0 and unstable if \ > 0. If 
= 0 then we obtain a degenerate phase diagram - parallel straight lines. 

Hence, the main types of the phases diagrams are the node (Aj, A2 are real, non- 
zero and of the same sign), the saddle (A;, Az are real, non-zero and of opposite signs), 
focus/spiral (1, A2 are imaginary and Re A ¥ 0) and center (A1, Az are purely imaginary). 
Otherwise, the phase diagram consists of parallel straight lines or just dots, and is referred 
to as degenerate. 

To summarize the stability investigation, let us emphasize that in the case Re A = 0 
both stability and instability can happen, depending on the structure of the Jordan normal 
form. 
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4.3 Lyapunov’s theorem 


Consider again an autonomous ODE 2’ = f(a) where f : Q — R” is continuously 
differentiable and 2 is an open set in R”. Let x be a stationary point of the system 
x' = f (x), that is, f (vo) = 0. We investigate the stability of the stationary point 29. 


Theorem 4.2 (Lyapunov’s theorem) Assume that f € C?(Q) and set A = f' (xo) (that 
is, A is the Jacobian matrix of f at xo). If ReA < 0 for all eigenvalues X of A then the 
stationary point xo is asymptotically stable for the system a’ = f (x). 


Remark. This theorem has the second part that says the following: if ReA > 0 for 
some eigenvalue \ of A then xo is unstable for x’ = f (x). However, the proof of that is 
somewhat lengthy and will not be presented here. 


Example. Consider the system 


{ gw! = /44+ 4y — 2e7t¥ 


y =sin3z + In(1—4y). 
It is easy to see that the right hand side vanishes at (0,0) so that (0,0) is a stationary 


point. Setting 
4+ 4y — Qer™t¥ 
oo eee ii 


sin 3x + In (1 — 4y) 


Oy f 1 a} ne 1 —2 —-l 
ASF (C0) = - = : 
F (0,0) ( Or fz Oy fa 3. 4 
Another way to obtain this matrix is to expand each component of f (x,y) by the Taylor 
formula: 


we obtain 


Filey) = 2 T+ y—2e¥ =2(14+5 +0(2)) 2(1+ (x+y) +0(|z| + |yl)) 
= —2x—y+o(la| + lyl) 
and 
fo(x,y) = sin3x¢+In(1—4y) = 32+ 0(a) — 4y+o(y) 
= 30 —4y + o((2| +9). 
Hence, 


saw)=( 3 24) (5 ) telethon, 


whence we obtain the same matrix A. 
The characteristic polynomial of A is 


= er a eee 
det ( 3 me +6A+ 11, 


and the eigenvalues are 


Mg = —3 +iV2. 
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Hence, Re < 0 for all A, whence we conclude that 0 is asymptotically stable. 


The main tool for the proof of theorem 4.2 is the following lemma, that is of its own 
interest. Recall that for any vector v € R” and a differentiable function F’ in a domain in 
R”, the directional derivative 0,F can be determined by 


Lemma 4.3 (Lyapunov’s lemma) Consider the system x' = f (x) where f € C1 (Q) and 
let ro be a stationary point of it. Let V (x) be aC! scalar function in an open set U such 
that %) € U CQ and the following conditions hold: 
1. V(x) >0 for any x € U \ {xo} and V (ao) = 0. 
2. For alla € U, 
Of(a)V (x) <0. (4.6) 


Then the stationary point xo is stable. 
Furthermore, if all x € U 
Os(nyV (2) < —W (a), (4.7) 


where W (x) is a continuous function on U such that W (x) > 0 for x € U \ {xo}, then 
the stationary point x9 is asymptotically stable. 


Function V with the properties 1-2 is called the Lyapunov function. Note that the 
vector field f (x) in the expression O;(z)V (x) depends on x. By definition, we have 


BrV (0) = 2 5 (0) fal). 


In this context, 0;V is also called the orbital derivative of V with respect to the ODE 
oe eee 


Before the proof, let us show examples of the Lyapunov functions. 


Example. Consider the system x’ = Az where A € £(R"). In order to investigate the 
stability of the stationary point 0, consider the function 


V (e) = Ilellg = D0 ai, 
k=1 


which is positive in R” \ {0} and vanishes at 0. Setting f (2) = Az, we obtain for the 
components 


fie (@) = D0 Angry. 
j=l 


Since a = 2p, it follows that 


OrV = S- Ae we S- Agi Bi Lies 
k=1 


j,k=1 
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The matrix (A;,;) is called a non-positive definite if 


Ss" Apjtjxp <0 for all x € R”. 
j,k=1 
Hence, in the case when A is non-positive definite, we have 0;V < 0 so that V is a 
Lyapunov function. It follows that in this case 0 is Lyapunov stable. Matrix A is called 
negative definite if 
S- Axj2jt, <0 for all x € R” \ {0}. 
jk=1 
Then setting W (7) = — 0... Anjxjx, we obtain 0s;V = —W so that by the second 
part of Lemma, 4.3, 0 is asymptotically stable. 
For example, if A = diag (A1,..., An) then A is negative definite if all \, < 0, and A is 
non-positive definite if all A, < 0. 


Example. Consider the second order scalar ODE 2x” + ka’ = F(x) which describes 
the movement of a body under the external potential force F(x) and friction with the 
coefficient k. This can be written as a system 


a 
y =—ky+F (a). 


Note that the phase space is R? (assuming that F is defined on R) and a point (a, y) in 
the phase space is a couple position-velocity. 

Assume F' (0) = 0 so that (0,0) is a stationary point. We would like to answer the 
question if (0,0) is stable or not. The Lyapunov function can be constructed in this case 
as the full energy 


Vey) =L 4000), 


where 


is the potential energy and ve is the kinetic energy. More precisely, assume that k > 0 
and 
F(x)<0 for «>0, F(x)>O0forz <0, 


and set a 
Ghee i F (s) ds, 
Jo 
so that U (0) = 0 and U (x) > 0 for c £ 0. Then the function V (x, y) is positive away 
from (0,0) and vanishes at (0,0). 
Setting 
f (ey) =(y, ky + F(a), 
let us compute the orbital derivative 0;V: 
ye (hy + Fe) 
= yU" (x) + (-ky + F(x))y 
= -yF () — ky’ + F(a)y = —ky? <0. 


O;7V = 
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Hence, V is indeed the Lyapunov function, and by Lemma 4.3 the stationary point (0, 0) 
is Lyapunov stable. 

Physically this has a simple meaning. The fact that F' (x) < 0 for x > 0 and F' (x) > 0 
for x < 0 means that the force always acts in the direction of the origin thus trying to 
return the displaced body to the stationary point, which causes the stability. 

Proof of Lemma 4.3. By shrinking U, we can assume that U is bounded and that 
V is defined on U. Set 


B, = B(20,r) = {a ER”: la — 29ll <r} 


and observe that, by the openness of U, B: C U provided ¢ > 0 is small enough. For any 
such ¢, set 
m(e)= inf V(x). 
xeU\B- 
Since V is continuous and U \ B, is a compact set (bounded and closed), by the minimal 
value theorem, the infimum of V is taken at some point. Since V is positive away from 
0, we obtain m (e) > 0. It follows from the definition of m (e) that 


V (xz) >m(e) forall ce U\ B,. (4.8) 
Since V (a9) = 0, for any given ¢ > 0 there is 6 > 0 so small that 
V (a) <m/(e) for all a € Bs. 


Fix y € B; and let x(t) be the maximal solution in R x U of the IVP 


oc 


We will show that x(t) € B, for all t > 0, which means that the system is Lyapunov 
stable at xo. 
For any solution x (t) in U, we have by the chain rule 


EV (o(t)) =V" (a) 2! (f) = V" (2) F (2) = Oye¥ (2) <0 (4.9) 
Therefore, the function V is decreasing along any solution x (t) as long as x (t) remains 
inside U. 

If the initial point y is in Bs then V (y) < m(e) and, hence, V (x (t)) < m(e) for t > 0 
as long as x(t) is defined in U. It follows from (4.8) that x(t) € B.. We are left to verify 
that x(t) is defined!’ for all t > 0. Indeed, assume that x (t) is defined only for t < T 
where T is finite. By Theorem 2.8, if t ~ T—, then the graph of the solution x (t) must 
leave any compact subset of R x U, whereas the graph is contained in the set [0,7] x Bz. 
This contradiction shows that T’ = +00, which finishes the proof of the first part. 

For the second part, we obtain by (4.7) and (4.9) 


£V (x(t) <-W(e(t)). 


15Since a (t) has been defined as the maximal solution in the domain R x U, the solution x (t) is always 
contained in U as long as it is defined. 
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It suffices to show that 
V (x (t)) ~ 0 as t— co 


since this will imply that x(t) — 0 (recall that 0 is the only point where V vanishes). 
Since V (a (t)) is decreasing in t, the limit 


C= lim V(a(t)) 


t—+00 


exists. Assume from the contrary that L > 0. Then, for all t > 0, V (a (t)) > L. By the 
continuity of V, there is r > 0 such that 


V (y) < L for all y € B,. 
Hence, x(t) ¢ B, for all t > 0. Set 


m= inf W > 0. 
a (y) 


It follows that W (a (t)) > m for all t > 0 whence 


SV (x(t) < -W(2(t)) <-m 


for all t > 0. However, this implies upon integration in ¢t that 
V (w(t) < V (@ (0) — mt, 


whence it follows that V (x (t)) < 0 for large enough t. This contradiction finishes the 
proof. # 

Proof of Theorem 4.2. Without loss of generality, set 79 = 0. Using that f € C?, 
we obtain by the Taylor formula, for any component f;, of f, 


n 1 n 
Sic (@) = fi (0) + Doh (0) a+ 3 Oij fx (0) ei; +0 (\|a||’) as a — 0. 

Noticing that 0; f;, (0) = Ax; write 

f (2) = Av + h(a) 
where h (a) is defined by 

1 n 
he (@) = 5 Y_ dij fe 0) ex; +0 ([2I)’) . 
i,j=l 


Setting B = max;,;; 


O;; fx (0)|, we obtain 


2 2 2 
Ih (2)lloo = max |e (x)| S By ee +o (llal|") = Bllelly +0 (lla(") - 
oe 


Hence, for any choice of the norms, there is a constant C’ such that 


Ir (xl < C [al|? (4.10) 
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provided ||z/|| is small enough. 
Assuming that Re A < 0 for all eigenvalues of A, consider the following function 


va) = | er*alfas (4.11) 
J0 


and prove that V (a) is the Lyapunov function. 
Let us first verify that V () is finite, that is, the integral in (4.11) converges. Indeed, 
in the proof of Theorem 4.1 we have established the inequality 


lle“ 2| < Ce™ (t% +1) |Iz|], (4.12) 
where C’, N are some positive numbers (depending on A) and 
a = max Re A, 


where max is taken over all eigenvalues \ of A. Since by hypothesis a < 0, (4.12) implies 
that Ijes4a|| decays exponentially as s — +00, whence the convergence of the integral in 
(4.11) follows. 

Next, let us show that V (x) is of the class C1 (in fact, C%). For that, represent x in 
the canonical basis vj, ...,U, as © = )> xv; and notice that 


n 
Iza = So lai? = aa. 
i=1 


Therefore, 


Jle**x|[5 = Cae = (= Xi (e*s)) . (= ps *e)) 
i J 

> Bhs (e°40; . e*4v;) : 

tJ 


Integrating in s, we obtain 


7) 


where bj; = ie (e%4y; . e*4y,) ds are constants, which clearly implies that V (a) is infi- 
nitely many times differentiable in x. 

Remark. Usually we work with any norm in R”. In the definition (4.11) of V (x), we 
have specifically chosen the 2-norm to ensure the smoothness of V (x). 

Function V (x) is obviously non-negative and V (x) = 0 if and only if « = 0. In order 
to complete the proof of the fact that V (x) is the Lyapunov function, we need to estimate 
Op(2)V (x). Let us first evaluate 04,V (x) for any x € U. Since the function y (t) = ez 
solves the ODE 4’ = Ay, we have by (4.9) 


dayoV (u(t) = SV (uli). 


Setting t = 0 and noticing that y (0) = x, we obtain 


Og V(): = {Vv (ez) : (4.13) 


On the other hand, 


V (etn) = fe (ea) [a= f° fern pas = fo 
J0 J0 Jt 


where we have made the change t = s+t. Therefore, differentiating this identity in t, we 
obtain 


2 
e2||, dtr 


d 
ay (ez) ——— | e4|). 


Setting t = 0 and combining with (4.13), we obtain 


d 
OarV (a) = ZV (e“2)) = — lel. 
t=0 


Now we can estimate O;(2)V (a) as follows: 
Op(ayV (w) = OacV (@) + On@yV (x) 
= —|lallp +V' (x) -h(@) 
< —lelle +1’ loll @)lle, 


where we have used the Cauchy-Schwarz inequality u-vu < |lul|,||v||, for all u,v € R”. 
Next, let us use the estimate (4.10) in the form 


\ 


Ik (x)llp SC llalls , 


which is true provided ||z||, is small enough. Observe also that the function V (x) has 
minimum at 0, which implies that V’ (0) = 0. Hence, if ||z||, is small enough then 


ee 
IV’ (ll, < 507. 


Combining together the above three lines, we obtain that, in a small neighborhood U of 
0, 


1 1 
2 2 2 
Op(z)V (t) S — lalla + 5 Mella = —5 lela. 


Setting W (x) = $|l||3, we conclude by Lemma 4.3, that the ODE 2’ = f (x) is asymp- 
totically stable at 0. m 

Now consider some examples of investigation of stationary points of an autonomous 
system 2’ = f (x). 

The first step is to find the stationary points, that is, to solve the equation f (x) = 0. 
In general, it may have many roots. Then each root requires a separate investigation. 

Let x9 denote as before one of the stationary points of the system. The second step is 
to compute the matrix A = f’ (xo). Of course, the matrix A can be found as the Jacobian 
matrix componentwise by Aj = Oz, fr (0). However, in practice is it frequently more 
convenient to do as follows. Setting X = x — x, we obtain that the system x’ = f (x) 
transforms to 


X'= f(x) =f (to +X) =f (xo) + f' (vo) X + 0(||X|) 


as X — 0, that is, to 
X' = AX +0/(||X||). 


131 


Hence, the linear term AX appears in the right hand side if we throw away the terms of 
the order o (||X||). The equation X’ = AX is called the linearized system for x’ = f (x) 
at Xo. 

The third step is the investigation of the stability of the linearized system, which 
amounts to evaluating the eigenvalues of A and, possibly, the Jordan normal form. 

The fours step is the conclusion of the stability of the non-linear system x’ = f (x) using 
Lyapunov’s theorem or Lyapunov lemma. If Re A < 0 for all eigenvalues \ of A then both 
linearized and non-linear system are asymptotically stable at x9, and if Re A > 0 for some 
eigenvalue \ then both are unstable. The other cases require additional investigation. 


Example. Consider the system 
ve =ytry, 
{ = ihe ee (4.14) 


For the stationary points we have the equation 


yt+tary =0 
r+azy=0 


whence we obtain two roots: (x,y) = (0,0) and (x,y) = (—1,-1). 
Consider first the stationary point (—1,—1). Setting X¥ = x+1 and Y = y+1, we 
obtain the system 


{ X’=(Y-1)X =-X4+XY =-X +0(|| (X,Y) ||) 


Yi=-(X-N)Y¥=¥—-XY=¥ + o(ll(X,Y)I) oe 


whose linearization is 


Hence, the matrix is 


and the eigenvalues are —1 and +1 so that the type of the stationary point is a saddle. The 
linearized and non-linear system are unstable at (—1,—1) because one of the eigenvalues 
is positive. 

Consider now the stationary point (0,0). Near this point, the system can be written 


in the form 
{ v’ =yto(ll(z,y) Il) 
y' = —2£ + 0 (|| (x,y) II) 


w= y, 
y =-2. 


O-4 
4=(40) 


and the eigenvalues are +7. Since they are purely imaginary, the type of the stationary 
point (0,0) is a center. Hence, the linearized system is stable at (0,0) but not asymptot- 
ically stable. 


so that the linearized system is 


Hence, the matrix is 


132 BeatriceGloria_personal library 


For the non-linear system (4.14), no conclusion can be drawn just from the eigenvalues. 
In this case, one can use the following Lyapunov function: 


Via,y) =2—In(g+1)+y—-—h(yt+1]), 


which is defined for > —1 and y > —1. Indeed, the function « — In(a# +1) take the 
minimum 0 at « = 0 and is positive for « 4 0. It follows that V (2, y) takes the minimal 
value 0 at (0,0) and is positive away from (0,0). The orbital derivative of V is 


OPV S(O ty) OV Sr aay Oy 


(tary) trea (i-gb 


I 
— 
< 

| 


Hence, V is the Lyapunov function, which implies that (0,0) is stable for the non-linear 
system. 

Since O;V = 0, it follows from (4.9) that V remains constant along the trajectories 
of the system. Using that one can easily show that (0,0) is not asymptotically stable 
and the type of the stationary point (0,0) for the non-linear system is also a center. The 
phase trajectories of this system around (0,0) are shown on the diagram. 
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